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IT) (54) Title: IDENTIFICATION AND VALIDATION OF NOVEL TARGETS FOR AGROCHEMICALS 
00 

(57) Abstract: The invention relates to a method for identifying and validating plant targets for agrochemicals, comprising the steps 
£5 of determining gene or protein expression profiles in function of the progression of an essential biological process in a plant, and the 
subsequent downregulation of expression of said gene or protein in a plant cell. More particularly, the effects of downregulationof the 
^ candidate target gene were directly monitored on plants locally infected with a vector mediating viral induced gene suppression in that 
infected plant area. The invention also relates to isolated plant genes encoding proteins involved in plant growth and development. 
The invention also relates to plants tolerant to agrochemicals such as herbicides or pesticides. 
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IDENTIFICATION AND VALIDATION OF NOVEL TARGETS FOR 

AGROCHEMICALS 

The invention relates to isolated plant genes encoding proteins essential for plant growth and 
development and to methods for identifying and validating these genes/proteins as target 
5 genes/proteins for agrochemicals, such as herbicides. A target for an agrochemical is a gene 
or a protein where the agrochemical interferes with when applied to the target organism. 

For the identification and validation of useful agrochemicals, the agrochemical industry 
traditionally relied on in vivo screening methods wherein chemical compounds were brought 

10 into direct contact with the living target organisms (e.g. plants for herbicide screening, insects 
for insecticide screening, etc.). However due to (i) the dramatic increase in the number of 
compounds that need to be screened to find a successful new agrochemical product, and (ii) 
the need to rely on very small quantities of compound such as are available in a combinatorial 
chemistry based compound libraries, and (iii) the need to identify compounds with a novel 

15 mode of action, the industry has developed a considerable interest in using more efficient and 
faster in vitro screening methods. 

To render such in vitro screening methods more successful, it is essential to carefully select 
the tested target gene/proteins and/or the tested agrochemicals. It has been described that a 

20 more practical in vitro approach for finding new agrochemicals would involve identification of 
target genes/proteins against which the agrochemical compounds could possibly work. For this 
process identification of suitable target genes/proteins, the conventional methods make use of 
gene knock-outs of the target organism. Gene knock-out libraries are generally made as a 
random collection of thousands of gene knock-outs. In these methods it is investigated if the 

25 gene/protein is essential for the growth and/or viability of the organism, since the knockout of 
an essential gene (when present in a homozygous state) leads to a lethal or otherwise 
detrimental effect on the organism. The indication that said gene/protein is essential to the 
organisms makes it a suitable target for an agrochemical. These conventional methods are still 
cumbersome and time consuming because of the use of gene-knockouts. Other techniques 

30 that are useful to estimate the essential character of a gene or its corresponding protein are 
based on the downregulation of said gene or protein for example via anti-sense expression 
technology (WO0 107601). 

To render an in vitro screening for agrochemicals more successful, it is essential to carefully 
35 select the tested target gene/proteins. Therefore a more practical in vitro approach for finding 
new agrochemicals could be a multistep process involving the steps of (1) identification of 
target genes/proteins against which the agrochemical compounds could possibly work, (2) 

1 
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validation of the candidate target gene as being an essential gene/protein for the organism and 
(3) use of these target genes/proteins in an in vitro screening procedure in which the chemical 
compounds are tested. 

It is the aim of the present invention to develop a process for the more efficient identification of 
5 candidate target genes/proteins for agrochemicals, combined with the more efficient validation 
of the target genes/proteins. It is a further aim of the invention to provide this process in order 
to design more efficiently the screening procedure with the agrochemical compound. 

The method of the present invention is based on the direct use of genetic information for 
10 example generated by expression profiling of the candidate target genes/proteins, for the 
identification and the validation of the targets. 

Therefore according to a first embodiment of the present invention, there is now provided a 
method for identifying and validating plant genes/proteins as targets for agrochemicals, said 
15 method comprising the steps of: 

a. determining gene or protein expression profiles during a biological process of a plant or 
plant ceil, said biological process being necessary for the viability or the growth of the 
plant or plant cell; 

b. selecting genes or proteins having altered expression during said biological process, 
20 c. cloning said selected gene or the nucleic acid encoding said protein in its full-length or 

partial form, 

d. incorporating said nucleic acid in a vector designed for downregulation of expression of 
said nucleic acid or the sequence homologous to said nucleic acid in a plant or plant* 
cell. 

25 

The aim of methods of the present invention is the identification of target gene(s)/protein(s) out 
of a broad range of candidate plant genes/proteins. The identification step is achieved by the 
techniques of expression profiling described in the following embodiments. Since the method 
of the present invention can be used for identification of genes/proteins or proteins, the term 

30 "target" as used herein can mean a gene as well as a gene product, namely a protein, 
polypeptide or peptide. With the expression "target for an agrochemical" is meant a protein as 
well as a gene or nucleic acid encoding such protein, and when such target is inhibited, 
stimulated or otherwise disrupted in its normal activity by an agrochemical compound, this 
would lead to a desired effect in a target organism. The invention aims at efficiently identifying 

35 targets for agrochemicals. Said agrochemicals can be herbicides or pesticides as well as 
growth stimulators or growth regulators. 
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Target identification means selecting candidate targets from a larger number of genes/proteins 
or proteins on the basis of certain properties that give such a molecule a higher probability of 
being a suitable target than other molecules which do not exhibit said properties. 
A herbicide target is a protein or gene that when inhibited, stimulated or otherwise disrupted in 
5 its normal activity by a compound would kill the (weedy) target plant or have a strong negative 
effect on its growth, said compound would therefore be a candidate herbicide. An insecticide 
target is a protein or gene that when inhibited, stimulated or otherwise disrupted in its normal 
activity by a compound would kill the insect pest or have a strong negative effect on its growth, 
said compound would therefore be a candidate insecticide. A plant growth regulator (PGR) 
10 target is a protein or gene that when inhibited, stimulated or otherwise disrupted in its normal 
activity by a compound would promote or alter in a desirable way the growth of plant, said 
compound would therefore be a candidate PGR. 

Nowadays a lot of genomic information, e.g. gene sequences, expression profiles, homologies 
15 and putative functionality, is available from genomic sequencing and expression studies in 
several target organisms. It is therefore of interest to develop a new method to identify and 
validate genes/proteins as candidate targets for agrochemicals, such methods being based on 
a direct use of such genomic information. This use of genomic information, e.g. the expression 
level of a gene, allows the selection of a limited set of appropriate candidate genes/proteins. 
20 Only this limited set of genes is then tested in the validation step, contributing to a higher 
efficiency and success rate of the screening procedure for agrochemicals. Furthermore, the 
genetic information, e.g. the functional data of the putative target gene/protein, is used as a 
basis to design more efficiently the in vitro screening procedure with the agrochemical 
compound(s) under investigation. 

25 

The present invention discloses methods that allow for the identification and validation of target 
genes/proteins for agrochemicals out of the broad range of possible genes/proteins and 
proteins. It therefore allows genes or proteins to be selected for the development of suitable in 
vitro screening methods for the screening of novel and efficient agrochemicals. 

30 According to a first step of the methods of the present invention target genes or gene products 
are identified by using transcript profiling of the genomic content of a ceil. By using this 
technique one immediately obtains genomic data (sequences and expression level) as well as 
a functional indication of the candidate target gene or gene product. Thus this method is useful 
for a first identification and selection of possible agrochemical target genes/proteins, since it 

35 provides as a bonus genomic and functional data on the candidate target. A good candidate 
target gene is a gene of which the expression varies significantly over the course of an 
essential biological process of the cell, since that is an indication that the gene/protein is 
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involved in that biological process The present application describes for the first time that the 
determination of an expression profile of a gene during the progression of an essential 
biological process is used to identify possible agrochemical targets. 

5 The expression profiling in the target identification steps of the method of the present invention 
is carried out in function of the progression of a process that is essential for plant growth 
and/or plant development and/or plant viability. In one preferred embodiment of the present 
invention, the essential process that is monitored in the target identification step is the process 
of cell division. Accordingly, in a particular embodiment of the invention, the method to identify 
10 target genes/proteins for agrochemicals is based on the transcript profiling of genes/proteins 
that are specifically involved in cell division. Therefore the invention provides a method as 
mentioned above, wherein said biological process cell division. 

Other biological processes that may be monitored for the identification and validation of 
15 agrochemical targets are for instance processes that are essential for seed germination, leaf 
formation, etc. 

The term expression profiling means determining the time and/or place when or where a gene 
or a protein is active. Particularly for a gene, this is achieved by monitoring the level of 
20 transcripts and therefore in the case of gene expression profiling the term transcript profiling or 
mRNA profiling is used. 

Generally, the expression profiling in the methods of the present invention is carried , out in 
function of the progression of a process that is essential for plant growth and/or development 
and/or plant viability. To achieve this, the process of interest is synchronized in a sufficient 

25 number of cells (for example in a cell culture) or organisms to allow collecting samples for 
expression profiling representing various stages of said process. Target identification then 
consists in selecting those genes or proteins that show significant changes in expression levels 
in function of the progression of the process of interest. It are those genes or proteins that are 
likely to be strongly involved or to be essential in said process. 

30 The term "essential" means that if the gene or the gene product cannot function as normal in 
the cell or organism, this will have significant implication in the cell growth or cell development 
or other vital functions of the cell or organism. 

According to the invention, the expression profiling can be studied at the level of m-RNA, using 
35 transcript profiling techniques, or alternatively at the level of protein, using proteomics-based 
approaches. 

4 
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In one preferred embodiment of the invention, m-RNA profiling is used for identification of 
target genes/proteins and expression levels may be quantified via techniques that are well 
known to the man skilled in the art. For instance, mRNA-profiling can be performed using 
micro-array or macro-array technologies, this method however requires that the gene 
5 sequences are known (full length sequences or at least partial sequences) and are physically 
available for coating on the micro or macro array surface. Standard chips are being 
commercialised for Arabidopsis, and sufficient sequence information is now available for 
different plant species (including rice) to allow sufficient sequence data for this approach. 
Another approach for mRNA profiling is the use of AFLP-based transcript profiling as 
10 described in example 1. In this approach short sequence tags are monitored. In a next step 
these short sequence tags may be matched with full-length genes/proteins if required. Gene or 
protein selection thus be based on either full-length or partial sequences and it is well within 
the realm of the person skilled in the art to find a full length sequence based on the knowledge 
of a partial sequence. 

15 Therefore, one aspect of the invention is the direct use of genetic information to* select 
candidate targets for agrochemicals. As mentioned above this genetic information can be 
generated by a number of techniques. Accordingly, the present invention encompasses a 
method as mentioned above, wherein the expression profiles are determined by means of 
micro-array, macro array or c-DNA-AFLP. 

20 

According to another embodiment of the invention, proteomic based approaches may be used 
to identify candidate target proteins for agrochemicals. 

It is now demonstrated that for the purposes of identifying a target gene for agrochemicals a 
25 synchronized culture of dividing plant cells is used to isolate samples and to monitor the 
expression of the transcripts of those cells during the progression of the cell division. 

Therefore according to a particular embodiment, the invention also encompasses a method for 
the identification and validation of plant agrochemical targets, wherein said gene or protein 
30 expression profiling is based on nucleic acid or protein samples collected from a synchronized 
culture of dividing plant cells. 

In one embodiment of the invention, the samples used for expression profiling are obtained 
from a synchronized culture of rice cells, tobacco cells, Arabidopsis cells or cells from any 
35 other plant species. The cell culture should be synchronized in order to obtain samples 
containing a sufficient amount of cells that are at the same stage of the biological process, so 
that the various samples taken for expression profiling are representative for the various 

5 
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stages of the essential biological process. In a particular embodiment of the present invention 
the samples are obtained from cells that are synchronized for cell division. In a preferred 
embodiment of the invention expression profiling is done on synchronized dividing cells. 
Certain cell lines are particularly suitable for synchronization of cell division, for instance 
5 synchronization of tobacco Bright Yellow-2 cell lines as described in example 1. Therefore 
most preferably, the synchronized cells are tobacco BY2 cells. By using synchronized tobacco 
BY2 cells and performing a cDNA-AFLP-based genome-wide expression analysis, the 
inventors built a large collection of plant cell cycle-modulated genes/proteins. Approximately 
1340 periodically expressed genes/proteins were identified, including known cell cycle control 

10 genes as well as numerous novel genes. A number of plant-specific genes were found for the 
first time to be cell cycle modulated. Other transcript tags were derived from unknown plant 
genes showing homology to cell cycle-regulatory genes of other organisms. Many of the genes 
encode novel or uncharacterised proteins, indicating that several processes underlying cell 
division are still largely unknown. These sequences are presented herein as SEQ ID NO 1 to 

15 SEQ ID NO 785. 

While, according to the invention, the basic criterion for identifying an agrochemical target 
gene or gene product consists in the differential expression levels of the gene or the protein 
observed during the progression of an essential biological progress, secondary selection 
20 criteria can be used and combined with this primary criterion. 

One such secondary criterion may be to make a selection of genes or proteins that are found 
not to exhibit a high degree of homology with genes or proteins from other organisms (such as 
mammals) as this criterion is likely to reduce the probability that the agrochemical compounds 
25 active on the "plant-specific" target genes or gene products would also exhibit toxic effects 
against other organisms, for example mammals. 

Another secondary selection criterion could exist in focussing on a particular phase of the 
essential biological process as mentioned above. For instance, when cell division modulated 
genes/proteins are under investigation as potential agrochemical target genes/proteins, one 

30 could preferably use those cell division modulated genes/proteins which exhibit high 
expression during the G1 phase, S phase, G2 phase or M phase or at the transition stages of 
these phases. In one embodiment of the present invention, the focus may be on the G2/M 
transition phase, since this phase in the plant cell cycle is considered to have more "plant 
specific" elements than other phases of the cell cycle and is therefore more likely to yield plant 

35 specific candidate target genes and proteins. Whereas the core cell cycle genes/proteins and 
the basic regulatory mechanisms controlling cell cycle progression are conserved among 
higher eukaryotes, basic developmental differences between plants and other organisms imply 
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that plant-specific regulatory pathways exist that control cell division. Especially for events 
occurring at mitosis, plants are expected to have developed unique mechanisms regulating 
karyo- and cytokinesis. A typical plant cell is surrounded by a rigid wall and can as such not 
divide by constriction. Instead, a new cell wall between daughter nuclei is formed by a unique 
5 cytoskeletal structure called the phragmoplast, whose position is dictated by another 
cytoskeletal array called the preprophase band. Another major difference between plant and 
animal mitosis is found in the structure of the mitotic spindles: in animals, they are tightly 
centred at the centrosome, whereas in plants they have a diffuse appearance. 
Therefore a suitable second criterion to combine with the first criterion may be to select 
10 genes/proteins that are involved in the mitosis step of the cell cycle and/or that are involved in 
the building of the cell wall during mitosis. 

Likewise a secondary selection criterion to be combined with the first criterion may be the 
selection of genes or proteins from a dicotyledonous plant that do not exhibit a high degree of 
homology with genes or proteins from a monocotyledonous plant (or vice versa). This 
15 secondary criterion is especially relevant when identifying agrochemical target genes or 
proteins with the intention to selectively identify targets that would allow for subsequence 
screening of selective herbicides or plant growth regulators. For instance, this strategy is 
advantageous to find targets and agrochemicals for selective weed control, such as herbicides 
that kill dicotyledonous weeds in monocotyledonous crops or vice versa. 

20 

Therefore according to further embodiments, the present invention encompasses methods as 
mentioned above, wherein the target gene or protein meets any one or more of the above 
mentioned secondary selection criteria, such as being plant specific, being mitosis specific or 
being dicot specific etc. 

25 

The possibility for combination of criteria used for selecting target genes or proteins renders 
the method of the present invention more powerful than classical methods. According to a 
preferred embodiment the technique of the present invention allows identifying genes/proteins, 
to be used as agrochemical target genes/proteins, these genes being genes/proteins that are 
30 involved in cell division and control of cell cycle progression, and these genes being novel and 
these genes being plant specific. Therefore the method of the present invention is 
characterized in that it allows identifying new and unexpected agrochemical targets. 

In the target gene identification step according to the present invention, genes or proteins are 
35 selected for which there is a high probability of being essential. It should be clear that the 
above-mentioned examples are given by way of illustration and are not meant to be limiting in 
any way. 

7 
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Further, according to a second step in the method of the invention, the candidate 
agrochemical target gene or gene product is subsequently validated as being essential for the 
growth and/or development and/or viability of the organism. This is achieved by cloning the 
5 identified candidate target gene in a vector construct designed to downregulate said target 
gene in a plant or plant cell, followed by inoculating the plant with this construct and monitoring 
whether downregulation of the gene results in negative effects on plant growth and/or 
development and/or viability. A valid target gene is a target gene that causes significant effects 
on growth of plants or plant cells when downregulated. The present application describes for 
10 the first time the use of a particularly fast and efficient downregulation method to validate 
possible agrochemical targets. 

Accordingly, the present invention encompasses a method as mentioned above for the 
identification and validation of plant targets for agrochemicals, wherein said downregulation 
1 5 involves a viral-induced gene silencing mechanism. 

Thus, starting from a number of candidate target genes/proteins identified in the first step of 
the method of the invention, the target validation step aims at confirming and demonstrating 
the essential nature of the gene by demonstrating that severe down-regulation of the 

20 expression level of the gene has a significant effect on the organism. 

In particular, when one is interested in developing a screening assay for herbicides, 
downregulation of the candidate target gene in a plant may result in a lethal effect, a severe 
inhibition of plant growth or any other (obviously) negative phenotypic effects. Alternatively, 
when one is interested in developing a screening assay for plant growth regulators, the effect 

25 of downregulating the target gene may be modulation or even stimulation of growth in general 
or modulation or even stimulation of a particular process associated with plant growth and/or 
development and/or architecture and/or physiology and/or biochemistry or any other 
phenotypic effect. 

30 The man skilled in the art will be aware of various methods to achieve downregulation of a 
given gene or protein, such methods include essentially co-suppression based approaches or 
anti-sense based approaches as well as any other method resulting in gene silencing. Other 
examples of downregulation in a cell are well documented in the art and include, for example, 
RNAi techniques, the use of ribozymes etc. Gene silencing may also be achieved by insertion 

35 mutagenesis (for example, T-DNA insertion or transposon insertion) or by gene silencing 
strategies as described by, among others, Angell and Baulcombe, 1998 (WO 98/36083), Lowe 
er a/., 1989 (WO 98/53083), Lederer et a/., 1999 (WO 99/15682) or Wang et a/., 1999 (WO 

8 
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99/53050). Expression of an endogenous gene may also be reduced if the endogenous gene 
contains a mutation. 



The effect of gene downregulation can be observed in stably transformed plants which can be 
5 obtained by means of various well known techniques, these techniques generally involving a 
plant transformation step and a plant regeneration step. 

Genes/proteins which exhibit a severe negative effect when downregulated may however 
significantly reduce transformation and/or regeneration efficiency. Therefore, a relevant 
parameter indicative for the essential nature of the gene, may be a severe reduction in 
10 transformation efficiency when said particular gene is used in a down-regulation construct. In 
order to avoid the (negative) effect on transformation efficiency in the transformation and 
regeneration process, an inducible promoter system can be used. Induction of promoter 
activity can then be applied at a later stage (after transformation) in order to observe the effect 
of gene downregulation once the transformed plant or plantlet started to develop. 

15 

Further, another method for testing the effect of downregulation of a target gene, which can be 
used in the methods of the present invention, is based on a rapid transient transformation 
process and does not rely on the somewhat lengthy process of stable transformation. The use 
of this method for target validation in plants is part of this invention, regardless of whether 
20 target identification has been performed according to this invention. 

Accordingly, in a preferred embodiment, the downregulation method is based on co- 
suppression and on rapid transient transfection of plant cells. The preferred method to validate 
genes/proteins as targets for agrochemicals is based on the cloning of the identified candidate 

25 target gene in a vector construct containing a viral replicase that is involved in the very efficient 
downregulation of the candidate target gene in the infected plant or plant cell via the 
mechanism of co-suppression. One advantage of this method for downregulation, is the fact 
that the infection of the host cells or the plant can be performed locally for example by 
inoculating the vector directly on the leaves. This allows a very fast evaluation of the effect of 

30 downregulating the candidate target since no complete transgenic plants have to be 
generated. Also this technique allows an easy way of monitoring the effect of the 
downregulated candidate target by simply looking at the changes of the infected place, for 
example monitoring the lethal effects on the infected leaf). 

35 Therefore in a preferred embodiment, the downregulation method is based on co-suppression. 
In a more preferred embodiment of the invention this co-suppression technique is fast and 
easy to evaluate the effect of downregulation, so that it is suitable for dealing with high 
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numbers of genes/proteins. This can be achieved by using viral induces gene silencing 
mechanisms (VIGS) and by infecting the plant directly and locally, for example on the leaves. 
Therefore, according to another embodiment, the present invention relates to the use of a viral- 
induced gene silencing system for validating plant targets for agrochemicals. 

5 

This method for severe downregulation via transient expression of the gene in the presence of 
certain viral elements is referred to as "virus-induced gene silencing mechanism" (VIGS) and is 
previously described in Ratcliff et a/., Plant J., 25 237 - 245, 2001. Briefly, virus vectors 
carrying host-derived sequence inserts induce silencing of the corresponding genes/proteins in 

10 infected plants. This virus-induced gene silencing is a manifestation of an RNA-mediated 
defence mechanism that is related to post-transcriptional gene silencing in transgenic plants. 
Ratcliff et a/., developed an infectious cDNA clone of Tobacco rattle virus (TRV) that has been 
modified to facilitate insertion of non-viral sequences and subsequent infection in plants. This 
vector mediates VIGS of endogenous genes/proteins in the absence of virus-induced 

15 symptoms. Unlike the other RNA virus vectors that have been used previously for VIGS, the 
TRV construct is able to target most RNA's in the growing points of the plant. A more detailed 
description of this downregulation mechanism is given in example 2. 

According to particular embodiments of the present invention, the VIGS system is applied in 
20 Arabidopsis or in tobacco for the purposes of validation of a candidate agrochemical target 
gene. 

According to a further preferred embodiment, there is provided a method for validation of a 
candidate agriochemical target gene, wherein the gene is downregulated in a plant via the use 
of infectious DNA of virus is Tobacco Rattle Virus and wherein said plant is tobacco. 

25 

The present invention relates to a combination of the above-mentioned identification and 
validation steps, which are especially selected so that they lead to an efficient selection of 
candidate target genes for agrochemicals. The outcome of the transcript profiling provides the 
necessary information and forms the basis for the second step, namely the validation of the 

30 target gene via incorporation of the gene sequence in the downregulation construct. The 
combination of these two techniques is especially useful for selecting suitable target 
genes/proteins for agrochemicals in a high throughput fashion. This technique thus overcomes 
the technical limitations of previously described techniques such as the knock-out libraries and 
the antisense strategies .without genetic information of the genes. This new combination offers 

35 a time-saving strategy for identification of a candidate target gene and the more direct 
information output in the form of a real sequence, the immediate cloning of the gene in the 

10 
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downregulation construct and immediate application of the downregulating construct on the 
target organism. 

The combination of these steps offers the unique opportunity to provide many high quality 
target genes/proteins for agrochemicals in a commercially and economically advantageous 
5 way. Furthermore, inherent to the techniques of the present invention is that the qualified 
target genes/proteins are accompanied with the necessary information to design a suitable in 
vitro screening assay with the agrochemical. This information consists of the expression 
characteristics of the genes/proteins and their function and importance in the essentia) 
biological process that was monitored during the transcript profiling. 
10 In this way, the methods of the present invention overcome the practical and commercial 
limitations of the existing techniques. 

Once this level of target validation is reached, the validated target can be selected for the 
development of an appropriate high-throughput in vitro screening method, wherein the 

15 agrochemical is tested. Therefore, the present invention also encompasses a method for 
screening candidate agrochemical compounds, comprising the use of any of the identification 
procedures and/or validation procedures as mentioned above. More particularly, the present 
invention encompasses a method for screening agrochemical compounds, comprising the use 
of any one or more of the sequences represented in SEQ ID NO 1 to 785. 

20 Various methods can be used to develop suitable in vitro assays for screening the chemical 
compounds, depending on what is known about the biological activity of the target gene. For 
example, when the target is an enzyme, measurement of the enzymatic activity of the target 
could form the basis of the in vitro screening assay with the chemical compound. 

25 Therefore, the methods of the present invention, the genes/proteins and the information 
generated by the combined identification and validation methods of the present invention, 
allow one to design and/or fine tune a screening for testing and/or developing agrochemicals 
(for example herbicides). For example if the expression pattern and the role of the target gene 
in the essential biological process is known, it is much easier to set up an in vitro screening 

30 assay to monitor the effect of a candidate herbicide on the target cells. Therefore it is expected 
that much more refined and/or efficient herbicides will be characterized using the methods of 
the present invention. 

Also because of the knowledge of its function, one can further design the screened 
35 agrochemical compound to improve its activity for instance to improve its binding capacity to 
the target. 



11 



BNSDOCID: <WO. 



.030851 15A2_I_> 



WO 03/0851 15 PCT/EP03/03703 

Therefore, the present invention encompasses a method for screening candidate agrochemical 
compounds comprising the use of any of the methods as mentioned above. 

The invention may also be applied for the development of agrochemical (for example herbicide 
5 or pesticide) tolerant plants, plant tissues, plant seeds and plant cells. 

Herbicides that exhibit greater potency can also have greater crop phytotoxicity. A solution to 
this problem is to develop crops that are resistant or tolerant to herbicides. Crop hybrids or 
varieties that are tolerant to the herbicides allow, for instance, for the use of herbicides that kill 
weeds without attendant risk of damaging the crop. Further it should be clear that when a plant 
10 is overexpressing the target of a particular herbicide, the tolerance of said plant against said 
herbicide will also be enhanced. 

Therefore the present invention also relates to the use of the agrochemical (e.g. herbicide) 
target genes/proteins as identified by the method of the present invention for generating 
15 transgenic plants that are tolerant or resistant to an agrochemical (e.g. herbicide). Example of 
genes and gene sequences identified by the combined identification and validation methods of 
the present invention and which can be used as agrochemical target or that can be used to 
obtain herbicide tolerant plants comprise the sequences as represented in any of SEQ ID NOs 
1 to 785. 

20 These sequences are derived from tobacco, but the one skilled in the art can easily find via 
homology search in databases or homology search in a cDNA library the homologues genes of 
other plant species, for instance monocot sequences (e.g the corresponding rice or corn 
sequence), and use them for the same purposes as described herein. These homology 
searches can be done for example with a BLAST program (Altschul et a/., Nucl. Acids Res., 25 

25 3389 - 3402, 1997) on a sequence database such as the GenBank database. Homology 
studies as referred to above can be performed using sequences present in public and/or 
proprietary databases and using several bioinformatics algorithms, well known to the man 
skilled in the art. Methods for the alignment of sequences are well known in the art, such 
methods include GAP, BESTFIT, BLAST, FASTA and TFASTA. GAP uses the algorithm of 

30 Needleman and Wunsch (J. Mol. Biol. 48: 443-453, 1970) to find the alignment of two 
complete sequences that maximizes the number of matches and minimizes the number of 
gaps. The BLAST algorithm calculates percent sequence identity and performs a statistical 
analysis of the similarity between the two sequences. The software for performing BLAST 
analysis is publicly available through the National Centre for Biotechnology Information. 

35 

Further, some of the tobacco sequences identified by the method of the present invention 
might be partial but again, the full-length sequence can easily be found based on the partial 
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sequence. For example "transcript building" can be done based on homology search on EST 
databases, cDNA's or gene predictions. These databases and programs are publicly available 
e.g. http://www.tigr.org/ . 

Therefore the present invention relates to the use of the nucleic acids as identified and 
5 disclosed herein and represented in SEQ ID NO 1 to 785, and also to the use of the full length 
genes regenerated from the partial sequences as well as to the use of the homologues 
sequences isolated from the same or from other plants. 

In another embodiment, the present invention relates to a nucleic acid identified according to 
10 the method of the invention. Thus the invention encompasses an isolated nucleic acid 
identifiable by any of the methods as mentioned above. 

In another embodiment, the invention relates to a nucleic acid identified according to the 
method of the invention, comprising the nucleic acid sequence chosen from the group of SEQ 
15 ID NO 1 to 785 or a full length sequence thereof, or a functional homologue thereof, or a 
functional fragment thereof, or an immunologically active fragment thereof. Thus the invention 
encompasses an isolated nucleic acid, comprising at least part of a nucleic acid sequence 
chosen from the group of SEQ ID NO 1 to 785 a homologue, functional fragment or derivative 
thereof. 

20 With "a functional fragment" is meant any part of the sequence that is responsible for the 
biological function or for an aspect of the biological function of the nucleic acid sequence. 

Further, the invention encompasses a method for the production of an agrochemical resistant 
plant, comprising the use of any one or more of SEQ ID NO 1 to 785 or a homologue, 
25 functional fragment or derivative thereof or one or more of the proteins encoded by SEQ ID NO 
1 to 785 or a homologue, functional fragment or derivative thereof. 

In one embodiment of the present invention the sequences, the full-length sequences and the 
homologues are used to develop herbicide tolerant plants. 

30 

Further the invention encompasses a plant tolerant to an agrochemical, in which the 
expression level of one or more of the nucleic acids corresponding the SEQ ID NO 1 to 785 or 
the homologue, functional fragment or derivative thereof, is modulated. Further the invention 
encompasses any part or more preferably any harvestable part of these plants. 

35 

Therefore the invention also relates to the use of these sequences, the full-length sequences 
and the homologues as targets for agrochemicals The invention encompasses the use of a 
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nucleic acid as mentioned above or the protein encoded by said isolated nucleic acid as a 
target for an agrochemical compound, preferably, wherein the agrochemical compound is a 
herbicide. 



5 Further, the invention relates to the use of these sequences to develop screening assays for 
the identification and/or development of agrochemicals. The invention encompasses a method 
for screening candidate agrochemical compounds comprising the use of any one or more of 
SEQ ID NO 1 to 785 or a homologue, functional fragment or derivative thereof or one or more 
of the proteins corresponding to SEQ ID NO 1 to 785 or a homologue, functional fragment or 
1 0 derivative thereof. 

The present invention will be further illustrated by the following figures, wherein, 
Figure 1 shows the gene expression profiles obtained by quality-based clustering of all 
transcript tags monitored in a transcript profiling experiment as described in example 1 . Shown 
15 are the trend lines of 16 clusters containing 97% of the genes and covering the entire time 
course as indicated on top. S-phase-specific gene clusters are grouped in A, gene clusters 
with peak expression between S- and M-phase are grouped in B, whereas group C contains 
the M- and G1 -phase-specific clusters. D: Three small clusters of genes with peak expression 
during two cell cycle phases. 

20 

Figure 2 shows the phenotypes of tobacco plants inoculated with a acetolactate synthase 
(SEQ ID NO 18) downregulation construct and phenotypes of tobacco plants inoculated with a 
prohibitin (SEQ ID NO 21) downregulation construct. The phenotypes were observed 12 days 
after inoculation (upper panel) or 17 days after inoculation (lower panel). 

25 

Figure 3 shows the phenotype of tobacco plants inoculated with a B-type CDK (SEQ ID NO 
11) donwregulation contruct. The observations were made 37 days after inoculation. 



Figure 4 shows the sequences identified by the methods of the present invention and 
30 represented by SEQ ID NO 1 to SEQ ID NO 785 

EXAMPLES 
Example 1 

A cDNA-AFLP based expression profiling of sequence obtained from samples of a 
35 synchronized tobacco BY2 cell line system, was used to identify genes that are upregulated 
during the cell cycle, an essential biological process needed for the viability and growth of the 
tobacco cell line system. 
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A genome-wide expression analysis of cell cycle-modulated genes in the tobacco Bright 
Yellow-2 (BY2) cell line was performed. This unique cell line can be synchronized to high 
. levels with different types of inhibitors of cell cycle progression (Nagata et a/., Int. Rev. Cytol., 
132 1 - 30, 1992; Planchais et al., FEBS Lett., 476 78 -83, 2000). Because of the lack of 
5 extensive molecular resources such as genomic sequences, cDNA clones or expressed 
sequence tags (ESTs) for tobacco, a microarray-based approach cannot be used for a 
transcriptome analysis. Therefore, the cDNA-AFLP technology was used to identify and 
characterize cell cycle-modulated genes in BY2. cDNA-AFLP is a sensitive and reproducible 
fragment-based technology that has a number of advantages over other methods for 

10 genome-wide expression analysis (Breyne and Zabeau, Curr. Opin. Plant Biol., 4 136 - 142, 
2001): it does not require prior sequence information, it allows identification of novel genes, 
and it provides quantitative expression profiles. After a detailed analysis, it was found that 
around 10% of the transcripts analyzed is periodically expressed. This comprehensive 
collection of plant cell cycle-modulated genes provides a basis for selecting and validating 

1 5 novel and unexpected agrochemical target genes 

Synchronization of BY2 cells and sampling of material. Tobacco BY2 (Nicotiana tabacum 
L. cv. Bright Yellow-2) cultured cell suspension were synchronized by blocking cells in early 
S-phase with aphidicolin as follows. Cultured cell suspension of Nicotiana tabacum L. cv. Bright 

20 Yellow 2 were maintained as described (Nagata et at., Int. Rev. Cytol., 132 1 - 30, 1992). For 
synchronization, a 7-day-old stationary culture was diluted 10-fold in fresh medium 
supplemented with aphidicolin (Sigma-Aldrich, St. Louis, MO; 5 mg/l), a DNA-polymerase a 
inhibiting drug. After 24 h, cells were released from the block by several washings with fresh 
medium and resumed their cell cycle progression. After the drug had been washed, samples 

25 were taken every hour, starting from the release from the aphidicolin block (time 0) until 1 1 h 
later. The mitotic index was determined by counting the number of cells undergoing mitosis 
under fluorescence microscopy after the DNA had been stained with 5 mg/l 
4',6-diamidino-2-phenylindole (Sigma-Aldrich). DNA content was measured by flow cytometry. 
This was done as follows A subsample was used to check cell cycle progression and synchrony 

30 levels. After the DNA had been stained with 5 mg/l 4',6-diamidino-2-phenylindole 
(Sigma-Aldrich), the mitotic index was determined under fluorescence microscopy by counting 
the number of cells undergoing mitosis. A mitotic peak of approximately 40% was obtained 8 h 
after washing. For flow cytometry, cells were first incubated in a buffered enzyme solution (2% 
cellulase and 0.1% pectolyase in 0.66 M sorbitol) for 20 min at 37°C. After the suspension had 

35 been washed and resuspended in Galbraith buffer (Galbraith et al.. Science, 220 1049 - 1051, 
1983), it was filtered through a 30-^m nylon mesh to purify the DAPI-stained nuclei. The 
fluorescence intensity was measured using a BRYTE HS flow cytometer (Bio-Rad, Hercules, 
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CA). Exit from S-phase was observed 4 h after aphidicolin release and the level of synchrony 
was shown to be sufficiently high throughout the time course. 



RNA extraction and cDNA synthesis. Total RNA was prepared by using LiCI precipitation 
5 (Sambrook et al., 1989) and poly(A + ) RNA was extracted from 500 p.g of total RNA using 
Oligotex columns (Qiagen, Hilden, Germany) according to the manufacturer's instructions. 
Starting from 1 fig of poly(A + ) RNA, first-strand cDNA was synthesized by reverse transcription 
with a biotinylated oligo-dT 25 primer (Genset, Paris, France) and Superscript II (Life 
Technologies, Gaithersburg, MD). Second-strand synthesis was done by strand displacement 
10 with Escherichia coli ligase (Life Technologies), DNA polymerase I (USB, Cleveland, OH) and 
RNAse-H (USB). 

cDNA-AFLP analysis. Five hundred ng of double-stranded cDNA was used for AFLP analysis 
as described (Vos et al., NucL Acids Res., 23 4407 - 4414, 1995; Bachem et a/., Plant J., 9 745 

15 - 753, 1996) with modifications. The restriction enzymes used were BstYl and Mse\ (Biolabs) 
and the digestion was done in two separate steps. After the first restriction digest with one of the 
enzymes, the 3' end fragments were collected on Dyna beads (Dynal, Oslo, Norway) by means 
of their biotinylated tail, while the other fragments were washed away. After digestion with the 
second enzyme, the released restriction fragments were collected and used as templates in the 

20 subsequent AFLP steps. The adapters used were: for BstYl, S'-CTCGTAGACTGCGTAGT-S* 
and 5'-GATCACTACGCAGTCTAC-3\ and for Mse\ t S'-GACGATGAGTCCTGAG-S* and 
5-TACTC AGG ACTCAT-3' ; the primers for BstY\ and Mse\ were 
S'-GACTGCGTAGTGATCtT/C)^^-^ and 5'- G ATG AGTCCTG AGTAAN ^ _ 2 -3' , respectively. For 
preamplifications, a Msel primer without selective nucleotides was combined with a BsfY\ primer 

25 containing either a T or a C as 3' most nucleotide. PCR conditions were as described Vos et a/., 
Nucl. Acids Res., 23 4407 - 4414, 1995). The obtained amplification mixtures were diluted 
600-fold and 5 \x\ was used for selective amplifications using a P 33 -labeled BstY\ primer and the 
Amplitaq-Gold polymerase (Roche Diagnostics, Brussels, Belgium). Amplification products were 
separated on 5% polyacrylamide gels using the Sequigel system (Biorad). Dried gels were 

30 exposed to Kodak Biomax films as well as scanned in a phospholmager (Amersham Pharmacia 
Biotech, Little Chalfont, UK). 

Quantitative measurements of the expression profiles and data analysis. Gel images 
were analyzed quantitatively with the AFLP-QuantarPro image analysis software (Keygene 
35 N.V., Wageningen, The Netherlands). This software was designed for accurate lane definition, 
fragment detection, and quantification of band intensities. All visible AFLP fragments were 
scored and individual band intensities were measured per lane. The obtained data were used to 
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determine the quantitative expression profile of each transcript. The raw data were corrected 
for differences in total lane intensities, after which each individual gene expression profile was 
variance-normalized . This was done as follows. 

The obtained raw data were first corrected for differences in total lane intensities which may 
5 occur due to loading errors or differences in the efficiency of PCR amplification with a given 
primer combination for one or more time points. The correction factors were calculated based on 
constant bands throughout the time course. For each primer combination, a minimum of 
10 invariable bands was selected and the intensity values were summed per lane. Each of the 
summed values was divided by the maximal summed value to give the correction factors. Finally, 

1 0 all raw values generated by QuantarPro were divided by these correction factors. 

Subsequently, each individual gene expression profile was variance-normalized by standard 
statistical approaches as used for microarray-derived data (Tavazoie et a/., Nature Genet., 22 
281 - 285, 1999). For each transcript, the mean expression value across the time course was 
subtracted from each individual data point after which the obtained value was divided by the 

15 standard deviation. A coefficient of variation (CV) was calculated by dividing the standard 
deviation by the mean. This CV was used to establish a cut-off value and all expression profiles 
with a CV less than 0.25 were considered as constitutive throughout the time course. 
The Cluster and TreeView software (Eisen et a/., PNAS, 95 14863 - 14868, 1998) was used 
for hierarchical, average linkage clustering. Quality-based clustering was done with a newly 

20 developed software program (De Smet et al., Bioinformatics 2002 May; 18(5): 735-46). This 
program is related to K-means clustering, except that the number of clusters does not need to 
be defined in advance and that the expression profiles that do not fit in any cluster are 
rejected. The minimal number of tags in a cluster and the required probability of genes 
belonging to a cluster were set to 10 and 0.95, respectively. With these parameters, 86% of all 

25 the tags were grouped in 21 distinct clusters. 

Characterization of AFLP fragments. Bands corresponding to differentially expressed 
transcripts were isolated from the gel and eluted DNA was reamplified under the same 
conditions as for selective amplification. Sequence information was obtained either by direct 

30 sequencing of the reamplified polymerase chain reaction product with the selective BstYl 
primer or after cloning the fragments in pGEM-T easy (Promega, Madison, Wl) or sequencing 
of individual clones. The obtained sequences wer§ compared against nucleotide and protein 
sequences present in the publicly available databases by BLAST sequence alignments 
(Altschul et at., Nucl. Acids Res., 25 3389 - 3402, 1997). When available, tag sequences were 

35 replaced with longer EST or isolated cDNA sequences to increase the chance of finding 
significant homology. Based on the homology, transcript tags were classified in functional 
groups as shown in Table 1 . 
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Experimental Results 

Identification and characterization of cell cycle-modulated genes 

Tobacco BY2 cells were synchronized by blocking cells in early S-phase with aphidicolin, an 
5 inhibitor of DNA polymerase a. After the inhibitor had been released, 12 time points with an 1-h 
interval were sampled, covering the cell cycle from S-phase until M-to-G1 transition. Flow 
cytometry and determination of the mitotic index showed that the majority of cells exit S-phase 
4 h after release from blocking and that the peak of mitosis is reached at 8 h. From each time 
point, extracted mRNA was subjected to cDNA-AFLP-based transcript profiling. 

10 Quantitative temporal accumulation patterns of approximately 10,000 transcript tags were 
determined and analyzed. In total, around 1,340 transcript tags were modulated significantly 
during the cell cycle. Hierarchical clustering of the expression profiles resulted in four large 
groups with the peak of expression in S-, early G2-, late G2-, or M-phase. Within each of these 
groups, several smaller clusters of genes with similar expression patterns could be 

15 distinguished. By quality-based clustering 21 different clusters were identified (see: 
http://www.plantgenetics/genomics/CCMgenes). In agreement with the hierarchical clustering, 
the four largest clusters (clusters 1 to 4 in Fig. 1) correspond to the S-, early G2-, late G2-, and 
M-phases and together contain 65% of all the tags. An additional cluster (cluster 5 in Fig. 1C), 
not clearly separated in the hierarchical clustering, includes the genes with peak expression in 

20 G1 -phase and contains another 5% of the tags. The remaining clusters are much smaller and 
most often (e.g., clusters 6, 9, 10, and 18)' include genes with a narrow temporal expression 
pattern. In addition to these clusters, three small groups of genes displaying elevated 
expression during two cell cycle phases were distinguished also by quality-based clustering 
(Fig. 1 D). 

25 After the transcript tags had been sequenced, homology searches revealed that 36.5% of the 
tags were significantly homologous to genes of known functions, 13.1% of the tags matched a 
cDNA or genomic sequence without allocated function, whereas for 50.4% of the tags no 
homology with a known sequence was found. Genes of known function belong to diverse 
functional classes (Table 1) revealing that several biological processes are at least partially 

30 under temporal transcriptional control during the cell cycle in plants. In general, the observed 
transcript accumulation profiles and cell cycle specificity correlate well with the functional 
properties of the corresponding genes. It is interesting that the number of transcription factors 
with G2-phase specificity is high, which may be related with the induction of genes involved in 
M-phase-specific processes. The overrepresentation of RNA-processing genes in the M-phase 

35 might indicate that post-transcriptional regulation is involved in gene activity during mitosis. 
Because de novo transcription is severely reduced during mitosis (Gottesfeld et a/., Trends 
Bioch. Sci., 22 197 - 202, 1997). RNA-processing could provide an alternative regulatory 
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mechanism. Intriguingly, transcript tags with homology to a gene of unknown function are 
overrepresented in the M-phase as well (Table 1). The principal differences in cell cycle events 
between plants and other organisms occur during mitosis; therefore, the inventors believe that 
several of these transcripts correspond to still uncharacterised plant-specific genes triggering 
5 these events. Remarkably, several of the tags homologous to a publicly available sequence 
have no Arabidopsis homologue, indicating that, in addition to conserved genes, different plant 
species possess also unique sets of cell cycle-modulated genes. Although many of these tags 
may be too short to significantly match with an Arabidopsis sequence, analysis of longer cDNA 
clones corresponding to a subset of tags has revealed that approximately 25% of the 

10 sequences remain novel. 

In Tables 1 to 4 a selection of 785 sequence tags are shown. This selection was based on the 
criterion if the tags were full length or that showed homology with genes known to be involved 
in the cell cycle (group 2 SEQ ID NOs 22 to 118), or on the criterion that they show homology 
with genes of unknown function (group 3 SEQ ID NOs 119 to 283) or on the criterion that the 

1 5 sequences showed no homology with the sequences in that existing databases (group 4 SEQ 
ID NOs 284-785). A first group (SEQ ID Nos 1 to 21) represent a smaller selection of tags 
which are used in the target validation method described in the present invention, more 
particularly, that were used in example 2. 

20 The core cell cycle machinery 

Several tags coincide with genes belonging to the core cell cycle machinery and exhibiting 
distinct expression profiles. Transcript tags from five B1- or B2-type cyclins as well as from a 
D2-type cyclin show mitotic accumulation and exhibit a narrow temporal expression profile, 
confirming previous studies (Mironov et a/., Plant Cell, 11 509 - 521, 1999; Sorrell et a/., Plant 

25 Physiol., 119 343 - 351, 1999). Based on the transcription patterns, the six A-type cyclins fall 
into three groups that sequentially appear during the cell cycle, adding new data to earlier 
observations (Reichheld et a/., PNAS, 93 13819 - 13824, 1996). Two groups have quite a 
broad window of transcript accumulation; one group, homologous to A3-type cyclins, is 
expressed during S-phase and disappears during G2-phase and the other group, 

30 corresponding to A2-type cyclins comes up at mid S-phase and goes down during M-phase, 
except for one transcript that is specific for S-phase. The third group, containing an A1-type 
cyclin, has the same expression pattern as the B- and D2-type cyclins. Several tags derived 
from genes encoding the plant-specific B-type cyclin-dependent kinases (CDKs) were also 
identified. CDKB1 and CDKB2 peak at the G2-to-M transition, slightly before the mitotic cyclins 

35 as describe (Porceddu et a/., J. Biol. Chem., 276 36354 - 36360, 2001). In contrast to what 
has been observed in partially synchronized alfalfa cell cultures (Magyar et a/., Plant Cell, 9 
223 - 235, 1997), the transcript levels of the tags homologous to a C-type CDK accumulate 
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differentially during the ceil cycle. The transcripts are present during late M-phase and early 
S-phase, suggesting that CDKC is active during the G1-phase. 

In addition to these well-characterized cell cycle-regulatory genes, also several tags were 
identified herein derived from genes encoding transcription factors and protein kinases or 
5 phosphatases with a known or putative role in cell cycle control. One tag with a sharp peak of 
transcript accumulation 1 h before the B- and D-type cyclins corresponds to a 3R-MYB 
transcription factor. Recently, a 3R-MYB has been shown to activate B-type cyclins and other 
genes with a so-called M-phase-specific activator domain (Ito et ai\ Plant Cell, 13 1891 - 
1905, 2001). Another tag peaking in M-phase is homologous to the CCR4 associated protein 

10 CAF. CAF forms a complex with CCR4 and DBF2, resulting in a transcriptional activator 
involved in the regulation of diverse processes including cell wall integrity, methionine 
biosynthesis and M-to-G1 transition (Liu et a/., EMBO J., 16 5289 - 5298, 1997). A majority of 
the tags with similarity to protein kinases and phosphatases show M-phase-specific 
accumulation (Table 1). Although the true identity and putative cell cycle related function 

15 remains unclear for the majority, one is highly homologous to a dual-specificity phosphatase. 
This type of phosphatases plays a crucial role in cell cycle control in yeast and animals 
(Coleman and Dunphy, Curr. Opin. Cell Biol., 6 877 - 882, 1994). Another M-phase-specific 
tag is homologous to prohibitin. In the mammalian cell cycle, prohibitin represses 
E2F-mediated transcription via interaction with retinoblastoma (Rb), thereby blocking cellular 

20 proliferation (Wang et a/., Oncogene, 18 3501 - 3510, 1999). 

Protein degradation by the ubiquitin-proteasome pathway also plays an important roleJn the 
control of cell cycle progression at both G1-to-S transition and exit from mitosis. Although there 
is little evidence for cell cycle-modulated expression of the genes encoding the various 
components of the ubiquitin-proteasome complexes, some proteins accumulate in a cell 

25 cycle-dependent way (del Pozo and Estelle, Plant Mol. Biol., 44 123 - 128, 2000). 
Furthermore, several tags were isolated herein from genes encoding ubiquitin-conjugating 
enzyme (E3), ubiquitin-protein ligase (E2), and proteasome components with an 
M-phase-specific expression pattern. Another transcript tag that accumulates during late 
M-phase is similar to cathepsin B-like proteins, which are proteolytically active and degrade 

30 diverse nuclear proteins, including Rb (Fu et a/., FEBS Lett., 421 89 - 93, 1998). 

Whereas all the core cell cycle regulatory genes have been identified that control the G2-to-M 
transition for which the expression is known to be cell cycle modulated, genes such as Rb and 
E2F, controlling G1-to-S transition were not found. These genes were probably missed 
because the G1-to-S transition was not included, in the present analysis, what is supported by 

35 the finding that the early targets of E2F, such as polymerase a and ribonucleotide reductase, 
are already present at high levels at the beginning of the time course. 
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Genes involved in DNA replication and modification 

In agreement with the studies performed in yeast and human fibroblasts, transcripts encoding 
proteins involved in DNA replication and modification accumulated during S-phase and 
exhibited broad temporal expression profiles. Different replication factors, DNA polymerase a, 
5 and the histones H3 and H4 are already present at the onset of the time course, indicating that 
they are induced before the time point of the aphidicolin arrest. Interestingly, most of the 
histones H1, H2A, and H2B appear somewhat later than H3 and H4, what might reflect that 
they are deposited into the nucleosomes after H3 and H4 (Luger et a/., Nature, 389 251- 260, 
1997; Tyler et a/., Nature, 402 555 - 560, 1999). The profile of the homologue of the 

10 anti-silencing function 1 (ASF1) protein is similar to that of the histones H3 and H4, in 
agreement with the fact that the three proteins are part of the replication-coupling assembly 
factor complex that mediates chromatin assembly (Tyler et al., Nature, 402 555 - 560, 1999). 
Genes encoding high-mobility group proteins reach the highest accumulation during late G2, 
consistent with the subsequent steps involved in the folding and structuring of the chromatin. 

15 Tags derived from genes encoding proteins involved in DNA modification, such as 
S-adenosyl-L-methionine (SAM) synthase and cytosine-5-methyl- transferase are found in the 
histone cluster. Tags from methionine synthase genes, which provide the precursor for SAM 
synthase, accumulate during M-phase, in contrast to yeast, where these genes are expressed 
during late S-phase (Spellman et a/., Mol. Cell Biol., 9 3273 - 3297, 1998). 

20 Genes involved in chromatin remodelling and transcriptional activation or repression have 
been identified as well. One gene is a histone deacetylase with highest transcript accumulation 
during the G2-phase and another belongs to the SNF2 family of chromodomain proteins with 
an M-phase-specific expression pattern. Interestingly, one tag corresponds to a mammalian 
inhibitor of growth 1 (p33-ING1) protein. The human ING1 protein has DNA-binding activity 

25 and might be involved in chromatin-mediated transcriptional regulation (Cheung and Li, Exp. 
Cell Res., 268 1 - 6, 2001). This protein accumulates during S-phase (Garkavtsev and 
Riabowel, Mol. Cell Biol., 17 2014 - 2019, 1997), what is in agreement with the expression 
profile we observed. The yeast homologues of ING1 are components of the histone 
acetyltransferase complex and show similarity to the Rb-binding protein 2 (Loewith et a/., Mol. 

30 Cell Biol., 20 3807 - 3816, 2000). Another tag, homologous to the Arabidopsis MSI3 protein, 
follows a similar expression profile. MSI-like proteins are involved in the regulation of histone 
acetylation and deacetylation and in chromatin formation (Ach et ai, Plant Cell, 9 1595 - 1606, 
1997). 

The expression profiles of the different ribonucleotide reductase (RNR) genes are more 
35 complex. One gene is already expressed at high levels at the beginning of the time course and 
its expression is restricted to the S-phase as described (Chaboute et a/., Plant Mol. Biol., 38 
797 - 806, 1998), whereas, in contrast, another one is highly expressed in S-phase and 
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reappears at lower levels during M-phase and a third one is M-phase-specific. This latter 
expression profile has also been described for a RNR gene from Xenopus where the encoded 
protein appears to be involved in microtubulin nucleation (Takada et a/., Mol. Cell Biol., 11 
4173-4187, 2000). 

5 Numerous other transcript tags with S-phase specificity were found in addition to the ones 
involved in DNA replication and modification. Most interestingly, one of these tags is 
homologous to a mammalian gene encoding a TRAF-interacting protein (TRIP), which is a 
component of the tumor necrosis factor (TNF) signalling complex, and promotes cell death 
when complexed with TRAF (Lee et a/., J. Exp. Medicine, 185 1275 - 1285, 1997). Another 
10 S-phase-specific tag shows homology to the RING finger domain of inhibitor of apoptosis 
proteins, which are also involved in the TNF signalling pathway. 

Modulated expression of genes required for mitosis and cytokinesis 

Several paralogous genes that encode either a- or p-tubulin were highly induced and 
15 accumulated prior to the mitotic index peak or during early M-phase. The inventors found that 
in BY2, tubulin genes are highly cell cycle modulated. This transcriptional regulation is in 
agreement with previous demonstrations of de novo transcription of a- and p-tubulin genes 
during different cellular processes (Stotz et a/., Plant Mol. Biol., 41 601 - 614, 1999). In the 
present analysis, no y-tubulin genes were found, confirming published data that the amount of 
20 y-tubulin is constant in dividing BY2 cells (Stoppin-Mellet et a/., Plant Biol., 2 290 - 296, 2000). 
Most of the kinesins identified herein, fall in the same cluster as the tubulins peaking prior to 
mitosis. Interestingly, two tags have a distinct transcription pattern and appear in another gene 
cluster. Their window of transcript accumulation is very narrow and coincides with the peak of 
mitosis. Most interestingly, these tags correspond to the plant-specific 
25 phragmoplast-associated type of kinesin, PAKRP1 (Lee and Liu, Curr. Biol., 10 797 - 800, 
2000). A chromokinesin not yet described in plants was identified as well. This type of motor 
proteins use DNA as cargo and play a role in chromosome segregation and metaphase 
alignment (Wang et a/., J. Cell Biol., 128 761 - 768, 1995). 

Among the M-phase-specific kinases, two were unambiguously recognized herein as playing a 
30 role in cytokinesis. One is Aurora, a protein kinase with a key role in the control of 
chromosome segregation, centrosome separation, and cytokinesis in yeast and animals 
(Bischoff and Plowman, Trends Cell Biol., 9 454 - 459, 1999) but not described in plants yet. 
The other is NRK1, a mitogen-activated protein kinase kinase which is phosphorylated by 
NPK1, a kinase involved in regulating the outward redistribution of phragmoplast microtubules 
35 (Nishihama et a/., Genes Dev., 15 352 - 363, 2001). 
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Hormonal regulation and cell cycle-modulated gene expression 

A number of genes belonging to the class of auxin-induced genes were also differentially 
expressed. Cell cycle-modulated expression of auxin-induced genes has never been observed 
before although auxins together with cytokinins are the two major groups of plant hormones 
5 that affect cell division (Stals and Inze, Trends Plant Sci., 6 359 - 364, 2001). The genes as 
identified herein fall into two groups based on their transcript accumulation profiles (data not 
shown). The first group displays an early S-phase-specific expression pattern and consists of 
the parA, parB and parC genes. Induction of the par genes is most often observed in response 
to stress conditions (Abel & Theologis, Plant Phys. 111, 9 - 17, 1996). The fact that the 
1 0 transcripts rapidly disappear after release from the cell cycle-blocking agent might indicate a 
stress response rather than a cell cycle dependent auxin response. 

More interesting is the second group of genes with transcripts accumulating during early 
M-phase. This group includes the auxin response factor 1 (ARF1), an auxin transporter as well 
as different members of the early auxin response AUX/IAA gene family. ARF1 is a transcription 

15 factor that binds to a particular auxin response element (Ulmasov et aL, Science, 276 1865 - 
1868, 1997). Additional studies suggest that the activity of ARF1 is controlled by its 
dimerization with members of the AUX1/IAA family (Walker and Estelle, Curr. Opin. Plant boil., 
1 434 - 439,1998). The similarity in temporal expression profiles the inventors observed 
supports these findings and suggests that these proteins mediate an auxin response 

20 necessary for cell cycle progression 

By using tobacco BY2 as model system together with cDNA-AFLP-based transcript profiling, it 
is described herein for the first time how a comprehensive inventory of plant cell 
cycle-modulated genes can be made. Although the obtained data confirm earlier results and 
observations, in addition, numerous novel findings were made. The obtained data are a very 

25 useful basis for selecting and validating agrochemical target genes. 

Example 2 

In this example it is described how plant genes are evaluated for assessment of their essential 
character in the biological process, thus how they are validated as good candidate targets for 
30 agrochemicals. 

The Tobacco Rattle Virus (TVR) is used to induce silencing of target genes . In case of an 
essential gene the simlencing will result in a lethal effect on the plant and therefore, the 
suystem allows to validate good candidates as targets for herbicides . 

The TRV based system is used in this example in combination with series of candidate genes, 
35 more particularly with the candidate targets as represented herein as group 1 sequences 
consisting of the SEQ ID NOs 1 to 21. The identification technique of the present invention 
(see example 1) allowed to identify new genes that are potential new herbicide targets, 
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because of their putative function in various key processes crucial for cell life, their expression 
at a certain developmental stage crucial for cell life, their role in metabolism and/or 
maintenance of cell living state. 

This example illustrates the validation of these candidate genes as novel targets for 
5 agrochemicals, via the technique of the virus-induced gene silencing (VIGS). 

Gene silencing mechanism 

The virus-induced gene silencing (VIGS) is a manifestation of an RNA-mediated defence 
mechanism that is related to post-transcriptional gene silencing (PTGS) in transgenic plants 

10 (Ratcliff et a/., Plant J., 25 237 - 245, 2001). The method uses a vector with an infectious 
cDNA of tobacco rattle virus (TRV) modified (see below) to facilitate insertion of target 
sequences and modified for efficient infection of plants (e. g. tobacco). The vector mediates 
VIGS of endogenous genes in the absence of specific virus-induced symptoms. 
The RNA-mediated defence is triggered by the virus vectors, and targets both the viral genome 

15 and the host gene corresponding to the insert. As a result, the symptoms in the infected plant 
are similar to loss-of-function mutants or reduced-expression mutants in the host gene. The 
presence of a negative growth phenotype suggests that the targeted gene is a potential 
herbicide target. 

The process of constructing a virus vector and monitoring symptoms on infected plants is 
20 completed within a few weeks, such that virus-induced gene silencing (VIGS) provides a 
simple, rapid means of assigning function to genes that have been sequenced but are 
otherwise uncharacterized. The determination of new herbicide target genes is performed in a 
few weeks including gene cloning, transformation steps and tobacco plant analyses. 
The TRV construct is shown to target host RNAs in the growing points of plants (Ratcliff et a/ M 
25 Plant J., 25 237 - 245, 2001) such as meristems and actively dividing cells. 

It has been shown that this vector overcomes many of the problem features of PVX, TMV and 
TGMV. For example, the TRV vector induces very mild symptoms, infects large areas of 
adjacent cells and silences gene expression in growing points such as meristems and actively 
dividing cells. Infection of tobacco plants on the leaves with TRV based constructs will affect 
30 growth and development of upper parts of the infected leaves and allow screening for growth 
parameters. 

Construction of TRV vectors used in the validation process of the present invention 

TRV is a positive-strand RNA virus with a bipartite genome. Proteins encoded by RNA 1 are 
35 sufficient for replication and movement within the host plant, while proteins encoded by RNA 2 
allow virion formation and nematode-mediated transmission between plants (reviewed by 
MacFarlane, J. Gen. Virol., 80 2799 - 2807,1999). 
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The downregulation system is composed of separate cDNA clones of TRV RNA 1 and RNA 2 
under the control of cauliflower mosaic virus (CaMV) 35S promoters on the transferred T-DNA 
of plant binary transformation vectors. 

The TRV RNA 1 construct (pBINTRA6) contains a full-length infectious cDNA clone in which 
5 the RNA polymerase ORF is interrupted by intron 3 of the Arabidopsis Col-0 nitrate reductase 
NIA1 gene (Wilkinson and Crawford, Mol. Gen. Genet., 239 289 - 297, 1993), necessary to 
prevent expression of a TRV-encoded protein that is toxic to E. coli. This vector has been 
given the internal reference number p3209. 

The TRV RNA 2 construct (pTVOO), contains a multiple cloning site (MCS), leaving only the 5' 
10 and 3' untranslated regions and the viral coat protein (Ratcliff ef a/., Plant Cell, 11 1207 - 
1215, 1999). This vector has the internal reference number p3930 and contains a Gateway™ 
cassette and the gene of interest to be tested. The genes as presented in SEQ ID NO 1 to 21 
are each cloned in this vector. 

cDNAs were amplified using Gateway compatible primers and the cDNAs were entered into 
15 Entry Clones by BP recombination reactions. Subsequently the entry clones comprising the 
gene according to any one of SEQ ID NO 1 to 21 were checked via Ban2 restiction digest. The 
genes of interest were then entered into destination vectors by LR recombination reactions and 
the destination vectors were checked via ECORV restriction digestions. These expression 
clones were electroporated into the Argobacterium strain GV3101 agro and the plasmid 
20 pBintra6 was electroporated into pMP90 agro. 

Inoculation 

To inoculate plants, Agrobacterium cultures carrying pBINTRA6 (strain C58C1RifR containing 
pMP90 plasmid) and pTVOO (strain GV3101 containing pMP90 plasmid) were grown and 

25 mixed and infiltrated to the leaves of Nicotians benthamiana as previously described (English 
et at., Plant J., 12 597 - 603, 1997). Briefly, virus infection was achieved by Agrobacterium- 
mediated transient gene expression. Agrobacterium containing the TRV cloning vectors were 
grown overnight in L brith (+Tc+Km), Agrobacterium containing the helper plasmid was grown 
overnight in 10 ml YEB+Rif+Km. The culture was centrifuged and resuspended in 10 ml of 

30 10mM MgCI 2 , 1mM MES-pH5.6 and 100jjM acetosyringone and kept at room temperature for 
2 h. Separate cultures containing pBINTRA6 and TRV cloning vectors were mixed in a ratio of 
1:10. The culture was then infiltrated to the underside of two leaves of three-weeks old plants 
using a 2 ml syringe without a needle. In two independent experiments 6 plants per 
agroabcterium clone were infected. In this way the cloned genes (SEQ ID NO 1-21) were 

35 transferred into the cells of the infiltrated region, and could be transcribed into the viral cDNAs 
in the leave cells. These transcripts then serve as an inoculum to initiate systemic infection of 
the plant. Consequently the VIGS system is activated, resulting in the downregulation of the 
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host cell gene, corresponding to the cloned gene of interest. All experiments involving virus- 
infected material was carried out in controlled growth chambers. N. benthamiana plants were 
germinated ad grown individually on universal potting ground in pots at 25°C during the day 
(16h) and 20°C during the night (8h). 

5 

The plants were phenotypically evaluated on a daily basis. Particular attention was given to 
visible leaf damage and growth inhibition. The effects of the suppression of gene activity using 
the VIGS system is measured by the phenotypic aspect of the plants, including leaf defects 
such as growth retardation, yellow or necrotic spots, early senescence, etc. The effects of the 
10 downregulation of genes identified by the methods of the invention are also measured on the 
flower structure and the flowering capacities of the transformed plants. 

The severity of the phenotype is linked to the level of suppression of the geneactivity and 
indicates the degree in which the gene is essential for the plant Therefor the phenotype is an 
indication of the degree in which the gene is a valid target for a herbicide. 

15 

Phenotypes of the infected plants. 

1 . Co-suppression of the gene leads to loss of gene transcription and protein expression in the 
virus infected leaf and induces leaf growth modification, including leaf wrinkling, curling, wilting, 
leading to cell death and/or plant death. 

20 

2. Co-suppression of the geneleads to loss of gene transcription and protein expression in the 
virus infected leaf and induces leaf yellowing or senescence, or cell degth and necrosis, 
leading to plant death. 

25 3. Co-suppression of the gene leads to loss of gene transcription and protein expression in the 
virus infected leaf and induces any of the following phenotypic symptoms: chlorotic regions 
around infection, crisp or crunchy leaf texture around infection, numerous surface lumps on 
either leaf surface, abnormal trichomes, abnormal leaf size, reduced growth, reduced final 
size, altered vascular leaf system, altered water movement in leaf , leading to cell death and/or 

30 plant death. 

4. Co-suppression of the gene leads to loss of gene transcription and protein expression in the 
virus infected leaf and induces any of the following anatomical symptoms: clumps of modified 
cells on the surface of the leaf (either abaxial or adaxial), individual cells detached from the 
35 epidermis, swollen or modified trichome cells, modification of leaf tissue structure, cell size, cell 
number, tissue composition, parenchyme, epidermis, etc , leading to cell death and/or plant 
death. 
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5. co-suppression of gene X leads to loss of gene transcription and protein expression in the 
virus infected leaf and induces any of the following biochemical symptoms, enzyme activity 
and products, degradation of leaf components and effects in neighboring leaves, stem, 
5 vascular system, .degradation of cell wall structure, communication between cells, modification 
of cell-cell signaling leading to cell death and/or plant death. 

The genes identified by the present invention can be utilized to examine herbicide tolerance 
mechanisms in a variety of plants cells, including gymnosperms, monocots and dicots. It is 
particularly useful in crop plant cells such as rice, corn, wheat, barley, rye, sugar beet, etc 

10 

Example 3 

Significant phenotypic alterations could be observed in plants infiltrated with Agrobacterium 
containing pBINTRA6 + Bstt44-4-340 (SEQ ID NO 18, acetolactate synthetase) and 
pBINTRA6 + Bstt2-42-520 (or T4-32-7) (SEQ ID NO 21, prohibitin) and pBINTRA6 + Bstt23-4- 
1 5 230 (SEQ ID NO 1 1 , B-type CDK). 

At 10days post-infiltration the first symptoms were visible. The symptoms were persistent until 
the end of the experiment and could be observed in at least 5 out of the 6 infiltrated plants. 

The phenotypes of the plants transformed with acetolactate synthase are further described. 

20 In two separate replicated experiments, specific phenotypes on each plant infected with the 
acetolactate synthetase downregulation construct were observed (Figure 2). Winkling and 
wrapping of the leaves as well as some chlorotic spots were observed. Thus acetolactate 
downregulation provoked a general growth arrest accompanied with chlorotic and necrotic 
areas. These observations were in line with previous reports, wherein acetolactate synthetase 

25 is described as a useful herbicide target. 

The phenotypes of the plants transformed with prohibitin are further described. 
In two separate replicated experiments, specific phenotypes on each plant infected with the 
prohibitin downregulation construct were observed (Figure 2). These plants showed strong 
30 wrinkling of the leaves about 20 days after infection, corresponding to the expected occurrence 
of silencing events. Thus the downregulation of probibitin provokes a severe leaf distortion and 
general growth arrest. 

The phenotype of the plants inoculated with a B-type CDK downregulation construct are shown 
35 in Figure 3. A late (from 30 days after inoculation) but strong negative effect on the plant 
growth was observed. The plants started to grow much slower and lost their apical dominance, 
resulting in the increased appearance of lateral branches. 
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Table 1. Functional classification of transcript tags 





Function 


Tags 


s 


G2 


M 


G1 


\j 






97 7% 
f . / /o 


15 8% 

1 *J . \J /U 


52 9% 


3 6 




Cell cycle control 


30 


5/8 (0.078) 


8/5 (0.068) 


14/1o (U.1 14) 


3/1 




Cell wall 


35 


6/10 (0.047) 


4/6 (0. 1 36) 


25/18 (7.1e ) 


0/1 




Cytoskeieton 


43 


1/12 (1.2e ) 


Ail (0.090) 


38/22 (2-1 e ) 


0/2 


10 


Hormone response 


13 


6/4 (0.113) 


1/2 (0.277) 


6/7 (0.185) 


0/0 




Kinases/phosphatases 1 


27 


4/8 (0.039) 


1/4 (0.059) 


19/14 (0.025) 


3/1 




Protein synthesis 


50 


15/14 (0.116) 


5/8 (0.087) 


29/26 (0.079) 


1/2 




Proteolysis 


21 


2/6 (0.026) 


1/3 (0.144) 


17/11 (0.039) 


1/1 




Replication and modification 


74 


57/20 (4.2e 19 ) 


8/12 (1.0e" s ) 


8/39 (1.0e' 18 ) 


1/3 


15 


RNA processing 


20 


1/6 (6.8e-3) 


1/3 (0.137) 


18/11 (8.1 e -4 ) 


0/0 




Signal transduction- 


10 


1/3(0.121) 


3/2(0.201) 


6/5 (0.205) 


0/0 




Stress response 


20 


6/6(0.192) 


2/3 (0.229) 


10/10(0.159) 


2/1 




Transcription factors 


27 


4/8 (0.039) 


10/4 (3.0e- 3 ) 


12/14(0.112) 


1/1 




Transport and secretion 2 


31 


5/9 (0.047) 


2/5 (0.076) 


21/16(0.031) 


3/1 


20 


Unknown 


175 


37/48 (0.015) 


19/28 (0.014) 


112/93 (S^e -4 ) 


7/6 



The total number of tags and the observed/expected number of tags within the different cell 
cycle phases for each functional group is given together with the probability values between 
parentheses as calculated based on the binomial distribution function, except for the G1 -phase 
25 because the values were too small. A significant enrichment (P<e -3 ) of tags of a functional 
group within a particular cell cycle phase is indicated in bold. 

1 Only kinases and phosphatases with unknown biological function. 

2 Except small GTP-binding proteins, which are classified under signal transduction. 



30 T able 2: overview of group 1 of sequences used for validation of candidate target gene s 



SEQ ID NO 


CDS NO 


tag Name 


Function 


Fase 


1 


2216 


18R1850 C4-32-33 1E2 


catalase 


?? 


2 


2217 


Bstt2-31-215 


phytoene desasturase 


?? 


3 


2218 


Bstc13-1-145 


L-ascorbate peroxidase 


M-G1 


4 


£219 


Bstc21 -4-280 


GTP-bindingprotein 


M 


5 


£220 


Bstc33-2-310 


Vacuolarsortingreceptor 


M 


6 


2221 


Bstc4-34-170 


probable cinnamyl alcohol dehydrogenase 


G1/S-S; M-G1 


7 


2222 


Bstt34-3-470 


kinesin 


M 


8 


2223 


Bsttl 2-3-410 


B-typeCDK 


M 


9 


2224 


Bsttl 4-3-458 


squalene mono-oxyqenase 


G1/S-S 


10 


2225 


Bsttl 2-1-230 


kinesin-likeprotein 


M j 


11 


2226 


Bstt23-4-230 


B-typeCDK 


M 


12 


2227 


Bstt2-42-225 


B-typeCDK 


M 


13 


2228 


Bstt31 -4-208 


arabinogalactan protein precursor 


G2/M-M I 


14 


2229 


Bstt 3-41-205 


arabinogalactan protein precursor 


G2/M-M ~J 
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1 




Bstt33-4-285 


chorismate synthase 


S-G2 J 


1ft 




Bs ti2-3 1-2 15 


ktnesin-Iikeprotein 


M 


17 


2232 


Bstt4 1-2-400 




Jv) 


18 


2233 


Bstt44-4-340 


acetolactate synthase 


G2/S-G2-M-G1 


19 


£234 


G1 7-2-1 3 G1 7-2-1 3 


WRKY transcription factor 


?? 


20 


2235 


mapk9-ntf6.seq 


mapkinase phragmoplast associated NTF6 ' 


? ? 


21 


2236 


Bstt2-42-520 


prohibiten 


?? 



Table 3: overview of group 2 sequences of full-length sequences that are cell cycle 



SEC 

ID 

NO 


*CDS 
NO 


Gene name 


22 


0613 


Protein kinase mRNA, complete , N. tabacum, 2073 bp 


23 


0614 


dy^ AALKimud proDaoie UNA-binding protein GBP16 - rice T02069, N. tabacum 834 bp 


24 


0615 


DizMAu^uuy proDaoie nuclear DNA-binding protein G2p [imported] in Arabidopsis T51 151 N. 

lauauuili, l 1 Ouup 


25 


0616 


□ r^MMu^j 1 1 iranscnpnon regulator-like in Arabidopsis AB025604, N. tabacum, 1893bp 


26 


0617 


BY2 AA044J23 ATP-dependent RNA helicase CA3 of the DEAD/DEAH box family; Dbp3p* BY2- 
rvrAu*t*+OiCor' iauu i r\iNM nencase kho in AraDidopsis 151 739 N. tabacum 1593bp 


27 


0618 


DU nMU4Du id protein pnospnatase ^G-like in Arabidopsis BAB08417 AB025622, N. tabacum 732bo 


28 


0619 


d i /i / o io i^t-o-o-jiKe protein L/ ryoo4o, n. tabacum, 70bp 


29 


0620 


d t c-MMUD'fLuy proiem Kinase tousled in Arabidopsis A49318 N. tabacum 2037bp 


30 


0621 


BY2-AA066H1 1 P19H05 phosphoprotein phosphatase 2A regulatory chain T03684 N. tabacum 1764 bp 


31 


Oft?? 


BY2-AA069L10 transcription factor-like protein in Arabidopsis BAB09482 AB012246, N tabacum 831 dd 


32 


Oft?? 


d y ^-mau ( *5 i\ud i protein, phospatase 2A inhibitor in Arabidopsis AAG52377.1 AC01 1765, N 
tabacum 


33 


0R94 


t5Y^-AAU/csiviKiyt5U/ phosphoprotein phosphatase 2A regulatory chain T03684, N. tabacum, 1764bo 


34 




t5Y^-AAU/bHl2 Putative phospatase 2A inhibitor in Arabidopsis AC011809 9 AC011809 N. tabacum 


35 


0626 


BY2-AA076O02P19B08 hypothetical protein kinase in Arabidopsis T47727, N. tabacum 2514 bp 


36 


0627 


BY^-MMu/yj u putative casein kinase I in Arabidopsis AAG51 841.1 AC010926 4 , N tabacum 1401 bo 


37 


0628 


d i ^-Mauouu porm i ^t)i\ in potato v>4b959, N. tabacum, 393bp 


38 


0629 


BY2- AA081P13p21E02 separation anxiety protein-like in Arabidopsis CAB96669.1 AL360314, N. 

laUCtLriJIli, T'yAiJIJ 


39 


0630 


v^ornpiementary copy ot uoou, N. tabacum, 975bp 


40 


0631 


BY2-AA085N17p21H04 14-3-3-hke protein in potato 16R P93784 N. tabacum 768bp 


41 




DY^-MAua/uibp^iuu3 AP2 domain transcription factor homoioq in potato T07784 N. tabacum 891bD 


49 


L/OOO 


BY2-AAU8 8fc31 3 putative RING zinc finger protein in Arabidopsis CAB80936.1 AL161491 N tabacum * 
1248bp 






t3Yz- AAuy^iviua protein kinase homolog in Arabidopsis T02181 N. tabacum858 




JUOJ 


d y ^-AAuybiviu ( peptidyi-prolyl cis-trans Isomerase-like protein BAB10691.1 AB015468 N. tabacum 
450bp 


4^ 
to 


JOOD 


d YZ-AAuybM i z zinc tinger protein-like in Arabidopsis BAB09106.1 AB017069 N. tabacum 1518bp 


46 


3637 


BY2-AA096M22 cell division-like protein in Arabidopsis T45963 N. tabacum 687bp 


4"7 


JOOo__l 


bY^-AAUybbUbp21U11 similarity to DAG protein in Arabidopsis BAA97063.1 AP000370 N. tabacum 
1 146bp 


48 ( 


3638 2 


Id AA091G16p21F05 N. tabacum 891 bp 




icon 


bY2-AAluyNl5 C3AMM1 protein-like in Arabidopsis BAB08430.1 AB017067 N. tabacum 888bp, (MYG1) 
-AMILY, proliferation associated 


50 ( 


3640 


Complementary copy of 0640 N. tabacum, 891 bp 


51 ( 


3641 


BY2-AA1 14N16 unknown protein in Arabidopsis BAB03019.1 AP001297; candidate tumor suppressor 
p33 ING1 homolog in Homo sapiens N. tabacum 720bp 


52 ( 


3642 


BY2-AA115P21p22D02 NAC2 Arabidopsis AAF09254.1 AF201456 1N. tabacum 699bp 


53 ( 


3643 


BY2-AA119N11p22G04 serine/threonine-specific protein kinase-like protein BAB09338 1 AB016879 N 
abacum 1293bp 


54 ( 


3662 


3Y2-AA041 E04 >pir)|T06678 hypothetical protein T1 7F1 5.80 - Arabidopsis thaliana 


55 ( 


3663 


3Y2-AA043A01 >gb|AAD24540.1 |AF1 13545^1 (AF1 1 3545) vacuofe-associated annexin VCaB42 
Nicotian a tabacum] 


56 [ 


3664 

i 


3Y2-AA044C02 >dbj|BAA02028.1| (D11470) chloroplast elongation factor TuB(EF-TuB) [Nicotiana 
abacum] 


57 ( 


)665 I 


3Y2-AA044L14 dbj|BAA97319.1| (AB020754) qene id:MYN8.3-pirllT02891 -similar to unknown protein 


58 C 


)666 I 
I 


3Y2-AA045P04pO1G1O sp|Q43681|NLTP_VIGUN PROBABLE NONSPECIFIC LIPID-TRANSFER 
PROTEIN AKCS9 


59 C 


)667 [ 


3Y2-AA046C08p19E02 db||BAB30364.1 1 (AK01 66593 putative TMus musculus] 



29 



BNSDOCID: <WO_ 



030851 15A2_L> 



WO 03/0851 15 PCT/EP03/03703 



60 


0668 


BY2-AA046E06 pir||T50556 stamina pistilloidia protein Stp [imported! - garden pea 


61 


0669 


BY2-AA046G14 dbj|BAB26082.1 1 (AK009117) putative fMus musculus] 


62 


0670 


BY2-AA046H23 emb|CAA98172.1| (Z73944) RAB8A [Lotus japonicus] 


63 


0671 


BY2AA048A05 gb]AAD1 5504.1) (AC006439) putativeAAA-type ATPase [Arabidopsis thaliana] 


64 


0672 


BY2-AA049K03 dbj|BAB24909.1 1 (AK007240) putative [Mus musculus] 1 


65 


0673 


BY2-AA051A10 dbj|BAB02543.1 1 (AP000417) mitotic checkpoint protein [Arabidopsis thalianal 


66 


0674 


BY2-AA051L22p19H03 gb|AAD48948.1|AF147262_1 1 (AF147262) contains similarity to Pfam family 
PF00400 -WD domain 


67 


0675 


BY2-AA052E10 >gb|AAF52905.1| (AE003628) CG4968 gene product [Drosophila melanogaster] 


68 


0676 


BY2-AA052F14 >gb|AAF79819.1|AC007396_20 (AC007396) T4012.22 [Arabidopsis thalianal 


69 


0677 


BY2-AA052G16p19D04 >dbj|BAB09843.1| (AB005246) geneJd:MUP24.12~unknown protein 
Arabidopsis thaliana] 


70 


0678 


BY2-AA052N17 >gb|AAG42914.1|AF327533 1 (AF327533) unknown protein [Arabidopsis thalianal 


71 


0679 1 


BY2-AA053C11.1 >dbj|BAB22857.1| (AK003561) putative [Mus musculus] 


72 


0679 


2 BY2-AA053C11.2 >gb|AAC62883.1 1 (AC005397) hypothetical protein [Arabidopsis thaliana] 


73 


0680 


BY2-AA062A09>gb|AAF01061.1|AF189284_1 (AF189284) nucleolar G-protein NOG1 [Trypanosoma 
brucei] 


74 


0681 


BY2-AA062G03 >pir||T02135 hypothetical protein F8K4.10 - Arabidopsis thaliana 


75 


0682 


BY2-AA065E08 >pir||T00795 hypothetical protein F24L7.13 - Arabidopsis thaliana 


76 


0683 


BY2-AA072K18 >emb|CAB40381.1| (AJ010819) GrpE protein [Arabidopsis thaliana] 


77 


0684 


BY2-AA075K12>gb|AAD31331.1|AC007354 4 (AC007354) T16B5.4 [Arabidopsis thaliana] 


78 


0685 


BY2-AA076N08 >dbj|BAA94770.1| (AP001859) ESTs AU082761(S5084) D42006 


79 


0686 


BY2-AA080D01 >gb|AAF80646.1|AC012190_2 (AC012190) Contains similarity to F28016.19 a putative 
translation initiation protein 


80 


0687 


BY2-AA081P14 >gb|AAD32777.1|AC007661 14 (AC007661) unknown protein [Arabidopsis thaliana 


81 


0688 


BY2-AA082H04p21F02 >dbj|BAB10171.1| (AB016880) geneJd:MTG10.12~pir||T05795-~strong similarity 
to unknown 


82 


0689 


BY2-AA082H06p21 G04 >pir||T09039 hypothetical protein F26K1 0.1 1 0 - Arabidopsis thaliana 


83 


0690 


BY2-AA082M07p21B05 >dbj|BAB01 783.1 1 (AB022215) gene Jd :MCB1 7.1 9~un known protein 
Arabidopsis thaliana] 


84 


0691 


BY2-AA083B24p21C04 >dbj|BAB08247.1| (AB006698) gene_id:MCL19.6~unknown protein [Arabidopsis 
thanliana) 


85 


0692 


BY2-AA083C05p21D02 >gb|AAH02924.1|AAH02924 (BC002924) Unknown (protein for 
IMAGE:3956179) [Homo sapiens] 


86 


0693 


BY2-AA085D08p21C05 >pir|lT47624 hypothetical protein T5N23.10 - Arabidopsis thaliana 


87 


0694 


BY2-AA085F09p21H01 >qb|AAF79503.1|AC002328 11 (AC002328) F20N2. 15 [Arabidopsis thaliana] 


88 


0695 


BY2-AA085M15p21D04 >gb|AAF97305.1|AC007843_8 (AC007843) Unknown protein [Arabidopsis 
thaliana] 


89 


0696 


BY2-AA088K23p21G05 >gb|AAG52001 .1 |AC012563_1 1 (AC012563) unknown protein; 64612-65506 
[Arabidopsis thaliana] 


90 


0697 


BY2-AA088L24p21A07 >gb|AAD55292.1|AC008263_23 (AC008263) Contains PF[00249 Myb-like DNA- 
binding domain. 


91 


0698 


BY2-AA089F12p21H05 >gb)AAD55274.1|AC008263_5 (AC008263) Strong similarity to gb|D21805 
calcium-dependent protein kinase 


92 


0699 


BY2-AA089M17 >pir||T02186 hypothetical protein F14M4.16 - Arabidopsis thaliana 


93 


0700 


BY2-AA090J23p21G08 >pir||T48545 hypothetical protein F14F1 8.30 - Arabidopsis thaliana 


94 


0701 


BY2-AA092F12p21H06 >emb|CAB46854.1 1 (AJ388555) hypothetical protein [Cam's familiaris] 


95 


0702 


BY2-AA092L20p21 E07 >gb|AAD10646.1| (AC005223) 45643 [Arabidopsis thaliana] 


96 


0703 


BY2-AA093J23p21C11 >g b| AAG5 1 46 1.1| AC 069160.7 (AC069160) unknown protein [Arabidopsis 
thaliana] 


97 


0704 


BY2-AA093L18p21D09 >emb|CAC1 5504.1 1 (AJ297917) B2-type cyclin dependent kinase [Lycopersicon 


98 


0705 


BY2-AA093M19 >qb|AAG1 2535.1 |AC01 5446 16 (AC015446) Unknown protein [Arabidopsis thaliana] 


99 


0706 


BY2-AA094B12p21F10 >dbj|BAB02118.1| (AP000381) contains similarity to unknown 


100 


0707 1 


BY2-AA096G05p21A11 dbj|BAB02118.1l (AP000381) contains similarity to unknown 


101 


0707 2 


Id AA094B12p21F10 


102 


0708 


BY2-AA097G22p21D10 >gb|AAG60065.1|AF337913_1 (AF337913) unknown protein [Arabidopsis 
thaliana 


103 


0709 


BY2-AA099F04 gb|AAG52457.1 |AC01 0852^1 4 (AC010852) hypothetical protein; 12785-11538 
[Arabidopsis thaliana] 


104 


0710 


BY2-AA099N08p21 H09 qb|AAK1441 1 .1 (AC087851 3 (AC087851) unknown protein Prvza sativa] 


105 


0711 


Id AA100B09 ref|NP 009820.1 1 Ybr261cp rSaccharomyces cerevisiae] 


106 


0712 


BY2-AA109N02 reflNP_002848.1| peroxisomal farnesylated protein; Housekeeping gene 33kD [Homo 
sapiens 


107 


0713 


BY2-AA1 14E09p22F02 pir||T51434 hypothetical protein F2G14 10 - Arabidopsis thaliana 


108 


0714 


BY2-AA115B14p22C02 dbj|BAB08888.1| (AB0 12243) geneJd:M!J24.6~ref|NP_01 3897.1 -similar to 
unknown protein 
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109 


0715 


BY2-AA1 15F08p22C04 >gb|BY2-AAH03900.1 |AAH03900 (BC003900) Similar to hypothetical protein 
384D8^6 [Mus musculus] 


110 


0716 


BY2-AA115L12p22G01 >gb|AAF43925.1|AC012188_2 (AC012188) Contains similarity to PIT1 from 
Arabidopsis thaliana 


111 


0717 


BY2-AA116L23p22E01 >dbj|BAB01460.1| (AP000731) gene Jd:MCB 17.21 -unknown protein 
[Arabidopsis thaliana] 


112 


0718 


BY2-AA117B12p21G12 >sp|O23708|PSA2 ARATH PROTEASOME SUBUNIT ALPHA TYPE 2 (20S 
PROTEASOME ALPHA SUBUNIT B) 


113 


0719 


BY2-AA117E08p22A03 >pir||F81195 conserved hypothetical protein NMB0465 [imported] - Neisseria 


114 


0720 


BY2-AA117O08p22E03 >dbj|BAB01 753.1| (AP000603) qb|BY2-AAD1 0646.1 -gene id:MRP15.12 


115 


0721 


BY2-AA118D23p22E02 >emb|CAB89490.1| (AJ277062) CRK1 protein [Beta vulgaris], cdc2 like kinase 


1 1o 


0722 


BY2-AA119D12p22H04 >dbj|BAB01 163.1 1 (AP000410) geneJd:K10D20.9~unknown protein 
[Arabidopsis thaliana] 


117 


0723 


BY2-AA120G12 >gb|BY2-AAB63649.1! (AC001645) hypothetical protein [Arabidopsis thaliana] 


118 


0724 


BY2-AA120G19p22D05 >gb|BY2-AAF69547.1|AC008007_22 (AC008007) F12M16.18 [Arabidopsis 
thaliana] 



Table 4: overview of group 3 sequences that show homology with proteins of unknown 
function 



SEQ 

(D 

NO 


Tag name and 


Function 


Fase 








119 


Bstd -11-320 




M-G1 


120 


Bstd -12-255 




G2/M-M-G1 


121 


Bstd -12-275 




G2/M-M-G1 


122 


Bstc1-1 3-143 


unknownprotein 


G2/M-M-G1 


123 


Bstd -13-1 60 


unknownprotein 


G2/M-M-G1 


124 


Bstd 1-3-190 




M-G1 


125 


Bstd 1-3-21 5 


putativeprotein 


G2/M-M-G1 


126 


Bstd 1-3-230 




G1/S; M-G1 


127 


Bstd 1-3-300 


unknown 


M-G1 


128 


Bstd 3-4-1 68 


hv o oth eti ca 1 d rote i n 


S-G2 


129 


Bstd 3-4-290 


hvDothpticalnrotein 


M-G1 


130 


Bstd 4-205 




G2/S-G2 


131 


Bstd -43-1 07 




G2/S-G2 


132 


Bstd 4-3-1 65 


unknown 


M-G1 


133 


Bstd -43-250 


unknown 


G2/M-M-G1 


134 


Bstd-43-310 


h VDoth s ti ca 1 n ro i n 


G2/M-M 


135 


Bstc21 -2-270 


hy poth eti calprotein 


G2/M-M-G1 

ju-/ ivi ivi I 


136 


Bstc2-21-182 


unknown 


M-G1 


137 


Bstc22-1-275 


unknownprotein 


G2-M-G1 


138 


Bstc2-22-100 


unknown 


G2-G2/M 


139 


Bstc2-22-155 




G2-M 


140 


Bstc2-22-240 


hypotheticalprotein 


M 


141 


Bstc22-2-270 




G1/S; M-G1 


142 


Bstc2-23-135 




G2/S-G2-M 


143 


Bstc2-23-220 


unknown 


G2-M-G1 


144 


Bstc22-4-215 


hypotheticalprotein 


G2/M-M 


145 


Bstc2-31-280 




G2/M-M-G1 


146 


Bstc23-2-240 


unknown 


M 


147 


Bstc23-2-330 


putativeprotein 


M 


148 


Bstc23-2-370 




G1/S-S; 
G2/M-M-G1 


149 


Bstc2-32-400 




G1/S-S; 
G2/M-M-G1 


150 


Bstc23-3-270 




G1/S-S; M-G1 


151 


Bstc2-33-280 


unknownprotein 


G1/S-S;M-G1 


152 


Bstc2-34-120 


unknown 


G2/M-M-G1 


153 


Bstc23-4-300 


unknown 


M 


154 


Bstc2-41-165 




G1/S-S 


155 


Bstc2-42-100 


unknown 


G1/S-S 


156 


Bstc2-43-210 




M-G1 


157 


Bstc31-185 


unknown 


G2/M-M-G1 


158 


Bstc3-1 2-145 


unknown 


S-G2 


159 


Bstc3-1 2-290 


unknown 


G2/M-M-G1 



SEQ 

ID 

NO 


Tag name and 


Function 


Fase 








160 


Bstc31 -3-400 


unknown 


G2/M-M-G1 


1o1 


bstc32-1-122 


unknown 


M-G1 


H CO 


D^+^O oh «IOC 

Bstc3-2 1-125 




G1/S-S; 


I DO 


dSICo^-Z-1 ou 


. . 

putativeprotein 


Oid/iVr-IVi-oi 


A Pi A 








I DO 






yj l/O-o-o^/o, 

M-G1 


I DO 


D5lCo-o l-OOU 


puiauveproiein 




1R7 


Rctr^^ O AAR 


n y p o i n en ca j p roxe i n 


o i/o-o, 


1fift 
I uo 


DolUiJ'OO Oj\J 




o l/o-o 


169 


Bstc33-360 


putativeprotein 


G2/M-M-G1 


\ f u 


DSlCoO-**-^ / V 


unknown 




i 71 


D c f r Q A A 07C\ 
DSIC»3-*f I -c. ( U 


unknown 


IVI-ol 


1 79 


DSlUO-^t l-oUU 




o^:/lVI-lVI-ijjl 


17^ 


D5lW-*t 1 -ODU 






174 


Bstc3-42-175 




M-G1 


175 


Bstc3-43-135 




G1 


176 


Bstc3-43-180 




M-G1 


177 


Bstc3-43-193 


unknown 


G1/S-S; 
G2/M-M-G1 


178 


Bstc3-43-287 




G1/S-S 


179 


Bstc3-44-145 




M-G1 


180 


Bstc3-44-375 


putativeprotein 


M-G1 


181 


Bstc4-1 1-120 


hypotheticalprotein 


G2/M^M-G1 


182 


Bstc4-1 1-320 


unknown 


M-G1 


183 


Bstc42-3-115 


unknown 


M-G1 


184 


Bstc42-3-125 


putativeprotein 


G2/M-M-G1 


185 


Bstc4-23-210 




M-G1 


186 


Bstc42-4-225 


unknown 


G1/S-S-G2 


187 


Bstc4-32-115 


unknownprotein 


G1/S-S; 
G2/M-M-G1 


188 


Bstc4-32-185 


unknown 


G1/S-S 


189 


Bstc4-32-190 


unknown 


G27M-M 


190 


Bstc4-32-270 


unknown 


G2/S-G2-M 


191 


Bstc4-32-410 




G1/S-S-G2- 
G2/M 


192 


Bstc4-34-250 




G2/M-M-G1 


193 


Bstc4-4 1-230 


putativeprotein 


G2/M-M-G1 


194 


Bstc4-43-113 


unknown 


M-G1 


195 


Bstc44-3-125 




G2/M-M 



31 
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196 


Bsttl -12-340 


unknown 


G2/M-M 


197 


Bsttl 2-2-225 




G1/S-S-G2 


198 


Bstt1-22-330 


unknown 


G2/M-M-G1 


199 


Bstt12-2-420 


unknown protein 


G2/M-M-G1 


200 


Bsttl 2-2-540 


hypotheticalprotein 


G2/M-M-G1 


201 


Bstt1-23-155 




M-G1 


202 


Bsttl 2-3-2 15 


hypotheticalprotein 


G2/M-M-G1 


203 


Bstt12-3-280 


unknown 


G1/S-S-G2 


204 


Bsttl 2-3-310 


hypotheticalprotein 


G1/S-S 


205 


Bsttl 2-3-350 




G1/S-S-G2- 
G2/M 


206 


Bstt1-24-205 




G2/M-M-G1 


207 


Bsttl -24-220 




G1/S-S-G2 


208 


Bsttl -31 -170 


hypotheticalprotein 


G2/M-M-G1 


209 


Bsttl -31 -21 5 


unknown 


G2/M-M-G1 


210 


Bsttl 3-210 


unknown 


G2/M-M-G1 


211 


Bstt14-4-310 


unknownprotein 


G2/M-M-G1 


212 


Bstt2-11-165 


unknown 


G2/M-M-G1 


213 


Bstt2-12-190 




G1/S-S-G2 


214 


Bstt21-4-150 


hypotheticalprotein 


G1/S-S-G2/S 


215 


Bstt21-4-250 




G1/S-S; 
G2/M-G1 


216 


Bstt21 -4-470 




G2/M-M-G1 


217 


Bstt22-1-170 


unknown 


S-G2 


218 


Bstt2-21-190 


unknown 


G2/M-M 


219 


Bstt22-2-190 


unknown 


G2/M-M-G1 


220 


Bstt22-2-290 


hypotheticalprotein 


G2/M-M-G1 


221 


Bstt22-3-225 




M 


222 


Bstt22-3-275 


unknown 


G2/M-M 


223 


Bstt22-3-315 


TomatoEST 


G2/M-M-G1 


224 


Bstt22-3-370 


unknown 


G2/M-M-G1 


225 


Bstt22-3-390 


putativeprotein 


G2/M-M-G1 


226 


Bstt22-3-480 




G2/M-M-G1 


227 


Bstt23-1-140 




S-G2-G2/M 


228 


Bstt23-120 


unknownprotein 


G2/M-M-G1 


229 


Bstt23-1-200 




S-G2-M 


230 


Bstt2-31-300 


unknown 


S 


231 


Bstt2-32-220 




M 


232 


Bstt2-32-400 


hypotheticalprotein 


G2/M-M-G1 


233 


Bstt23-3-350 


unknown 


G2-M 


234 


Bstt23-370 


unknown 


G2/M-M-G1 


235 


Bstt24-1-320 




S-G2 


236 


Bstt24-2-310 




G2/M-M-G1 


237 


Bstt2-43-210 


unknown 


G2-M 


238 


Bstt2-43-240 




S-G2/S 


239 


Bstt31-1-100 


hypotheticalprotein 


G1/S-S-G2 



240 


Bstt3-1 1-205 




G1/S-S-G2 


241 


Bstt31 -1-250 


hypotheticalprotein 


G2/M-M-G1 


242 


Bstt31-1-430 


hypotheticalprotein 


G2/M-M-G1 


243 


Bstt3-1 2-360 


unknownprotein 


G2/M-M 


244 


Bstt31 -3-380 




G1/S-S 


245 


Bstt31 -4-420 


hypotheticalprotein 


G2/M-M-G1 


246 


Bstt32-180 


putativeprotein 


G2-M-G1 


247 


Bstt3-22-160 


PotatoEST/hypothet 
icalprotein 


G1/S-S-G2 


248 


Bstt32-3-175 


unknown 


G2/M-M 


249 


Bstt32-3-325 


unknown protein 


G2/M-M-G1 


250 


Bstt3-24-135 


unknown 


G2/M-M-G1 


251 


Bstt3-24-200 




G2/M-M-G1 


252 


Bstt3-31-215 


unknownprotein 


G2/M-M-G1 


253 


Bstt3-31-330 


unknown 


G1/S-S-G2 


254 


Bstt33-1-350 


unknown 


G2/M-M-G1 


255 


Bstt33-1-510 


putativeprotein 


G2/M-M-G1 


256 


Bstt33-3-220 


unknown 


G2/M-M-G1 


257 


Bstt33-3-245 


unknownprotein 


G2/M-M-G1 


258 


Bstt3-33-550 


hypotheticalprotein 


G1/S-S; M-G1 


259 


Bstt33-4-140 


putativeprotein 


S-G2 


260 


Bstt34-2-165 


unknown 


G1/S-S-G2 


261 


Bstt3-42-325 


hypotheticalprotein 


G2/M-M-G1 


262 


Bstt3-44-150 


unknown 


G2/M-M-G1 


263 


Bstt3-44-250 


unknown 


G2/M-M-G1 


264 


Bstt34-4-310 


unknown 


G2/M-M-G1 


265 


Bstt3-44-345 


hypotheticalprotein 


G2/M-M-G1 


266 


Bstt4 1-2-340 




G2/M-M-G1 


267 


Bstt41 -3-310 


unknown 


G2/M-M" 


268 


Bstt4-21-185 




M-G1 


269 


Bstt42-1-370 




S-G2-G2/M 


270 


Bstt4-23-480 


unknown 


G2/M-M-G1 


271 


Bstt4-24-170 




G2/M-M-G1 


272 


Bstt43-265 


unknown 


G1/S-S-G2/M 


273 


Bstt43-3-350 


unknown 


G2/M-M-G1 


274 


Bstt4-33-390 


hypotheticalprotein 


G1/S-S;G2/M- 
M-G1 


275 


Bstt4-34-280 




G2/M-M-G1 


276 


Bstt43-4-300 


unknownprotein 


G2/M-M-G1 


277 


Bstt43-4-330 


unknownprotein 


G2/M-M-G1 


278 


Bstt43-4-340 




G2/M-M-G1 


279 


Bstt44-4-250 


hypotheticalprotein 


G2/M-M 


280 


Bstt4-44-400 


hypotheticalprotein 


G2/M-M-G1 


281 


MBc03-90 


unknown 


S-G2 


282 


MBc42-180 


unknown 


G2-M-G1 


283 


MBC43-210 


unknown 


G1/S-S-G2 



SEQ 

ID 

NO 


Tag name 


Function 


Fase 


284 I 


9std 1-100 


unknown 


G2/S-G2-M 


285 I 


&stc1 -11-110 


unknown 


S 


286 f 


^std -11-115 


unknown 


G1/S-S;G2/M-M-G1 


287 I 


Bstd -11-120 




G1/S-S-G2 


288 I 


istd 1-1-125 


unknown 


G2/M-M-G1 


289 


istd 1-1-290 


NaD 


G1/S;G2/M-M-G1 


290 


istd -12-155 




G2/S-G2-M 


291 


istd -12-175 


unknown 


S 


292 fi 


istd -12-185 


unknown 


G2/M-M-G1 


293 


istd 1-3-116 


unknown 


S-G2 


294 E 


istd 1-3-118 


unknown 


G2/M-M-G1 


295 ( 


istd -13-120 




S 


296 


istd -13-130 




G1/S-S; G2/M-M-G1 


297 f 


?std -13-132 


unknown 


M-G1 



SEQ 

ID 

NO 


Tag name 


Function 


Fase 


298 I 


3stc1 -13-142 


unknown 


G1/S-S 


299! 


3stc 11-3-187 


unknown 


S-G2/S 


300 f 


3stc1 1-3-200 


unknown 


G1/S-S-G2/S 


301 ( 


3stc1 1-3-290 


unknown 


G2/S-G2-M-G1 


302 I 


istd -14-100 


unknown 


G2/M-M 


303 ( 


istd -14-108 


unknown 


G2/M-M-G1 


304 I 


istd 1-4-130 


unknown 


G1/S-S-G2 


3p5! 


istd 1-4-135 


unknown 


G2/M-M-G1 


306 ! 


istd 1-4-140 


unknown 


S-G2-M 


307 


istd -14-155 




G2/M-M 


308 \ 


istd -14-165 




G2-G2/M 


309 I 


istd -14-167 




G2-G2/M 


310 


istd 1-4-175 




G2/M-M-G1 


311 \ 


istd 1-4-200 


unknown 


G1/S-S 



32 



BNSDOCID: <WO. 



030851 15A2_I_> 



WO 03/085115 



312 


festd 2-1-110 


unknown 


S-G2 


313 


!3std -21-150 


unknown 


G2/M-M-G1 


314 


3std 2-1-160 


unknown 


G2-M-G1 


315 


3std 2-1-240 


unknown 


M-G1 


316 


3std 2-1-95 


unknown 


G1/S-S-G2 


317 


3std -22-110 




G2-M-G1 


318 


3std 2-3-103 


unknown 


G2/M-M-G1 


319 


3std 2-3-125 


unknown 


G1/S-S; G1 


320 


3std 2-3-235 




M-G1 


321 


3std 2-3-237 


unknown 


G1/S-S 


322 


3std 2-4-130 


unknown 


G2/M-M-G1 


323 


Bstd 2-4-133 


unknown 


S-G2 


324 


3stc1 2-4-145 


unknown 


M-G1 


325 


3std 2-4-235 


unknown 


G2/M-M-G1 


326 


3std 3-1-150 




M-G1 


327 


Bstd 3-2-170 


unknown 


G2/M-M-G1 


328 


3std 3-2-180 


unknown 


G1/S-S 


329 


3std 3-2-190 


unknown 


G1/S-S 


330 I 


3std 3-2-280 


unknown 


G1/S-S; G2/M-M-G1 


331 I 


3std -41-170 


unknown 


G1/S-S 


332 I 


3std -41-175 


unknown 


G1/S-S 


333 I 


Bstd -41-180 


unknown 


G1/S-S; G2/M-M-G1 


334 I 


3std -41-210 


unknown 


G1/S-S 


335 f 


3std -41-230 




G1/S; G2/M-M-G1 


336 ( 


Bstd 4-2-140 


unknown 


M-G1 


337 f 


3std -42-150 


unknown 


G2/S-G2 


338 i 


3stc1 -42-80 


unknown 


G1/S-S-G2 


339 i 


3std -42-90 


unknown 


G2-M 


340 8 


3std -43-105 




G2/M-M 


341 


Bstd 4-3-105 




G1/S-S; G2/M-M 


342 


3std -43-110 




G1/S-S; G2-M 


343 I 


3std 4-3-130 


unknown 


G2/M-M-G1 


344 i 


3std -43-1 4a 


unknown 


S-G2 


345 


Bstd -43-150 




G2/M-M-G1 


346 I 


Bstd -43-175 




S-G2 


347 I 


Bstd -43-185 


unknown 


G1/S-S-G2/S 


348 ! 


Sstd 4-3-235 


unknown 


G1/S-S 


349 i 


3stc1 4-3-260 


unknown 


G2/M-M-G1 


350 [ 


3std -43-65 


unknown 


G1/S-S-G2 


351 t 


3stc1 -43-75 


unknown 


S-G2 


352 I 


3stc1 -44-138 


unknown 


G1/S-S-G2/S 


353 I 


Bstd -44-140 


unknown 


G2/S-G2-M 


354 I 


3std -44-157 


unknown 


G2/S-G2 


355 I 


3std 4-95 


unknown 


G2/M-M 


356! 


3stc2 1-1-100 


unknown 


G2/M-M-G1 


357 i 


3stc2 1-1-140 


unknown 


G1/S-S-G2 


358 I 


3stc2 1-1-145 


unknown 


M-G1 


359! 


3stc2 1-1-65 


unknown 


G2-M-G1 


360 I 


3stc2 1-2-120 




G2/M-M 


361 I 


3stc2 1-2-215 




G2/M-M 


362 I 


3stc2 1-2-75 




S-G2-M 


363 ! 


3stc2 -13-110 




G1/S-S;G2/M-M 


364 I 


3stc2 -14-100 


unknown 


G2/M-M-G1 


365 I 


3stc2 1-4-120 


unknown 


M-G1 


366! 


3stc2 -14-125 


unknown 


G2/M-M-G1 


367 i 


3stc2 1-4-130 


unknown 


G2/M-M-G1 


368 i 


3stc2 -14-135 


unknown 


S-G2/S 


369 I 


3stc2 1-4-135 




S-G2 


370 i 


3stc2 1-4-155 


unknown 


G2/M-M-G1 


371 f 


3stc2 -14-160 




M-G1 


372 | 


3stc2 1-4-180 


unknown 


G2/S-G2 


373 I 


3stc2 2-100 


unknown 


G2-M 


374 i 


3stc2 -21-120 


unknown 


G1/S-S 


375 \ 


3stc2 2-1-125 


unknown 


S-G2 


376 ( 


3stc2 -21-170 


unknown 


M-G1 
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377 


festc22-1-98 


unknown 


S-G2-G2/M 


378 


3stc2 2-2-110 


unknown 


G2/M-M-G1 


379 


3stc2 -22-160 


unknown 


G1/S-S; G2-G2/M 


380 


3stc2 2-2-165 


unknown 


G1/S-S 


381 


3stc2 -22-90 




S; G2-M 


382 


3stc2 -23-110 


unknown 


G2/M-M 


383 


3stc2 -23-140 




M-G1 


384 


3stc2 2-3-1 50 




S-G2 


385 


3 stc2-23-175 




M-G1 


386 


3stc2 -23-195 


unknown 


M-G1 


387 


3stc2 2-3-90 




M-G1 


388 


3stc2 -24-100 


unknown 


G2/M-M-G1 


389 


3stc2 2-4-140 




G1/S-S-G2-M 


390 I 


3stc2 -24-165 




G2/M-M 


391 I 


3stc2 -24-170 


unknown 


G1/S-S 


392 I 


3stc2 -31-140 


unknown 


G2/M-M-G1 


393 I 


3stc2 -31-160 




M-G1 


394 I 


3stc2 -31-170 


unknown 


M-G1 


395 ( 


3stc2 3-2-135 


unknown 


G2/M-M-G1 


396 f 


3stc2 -32-285 




G2/M-M 


397 I 


3stc2 3-2-360 


unknown 


G1/S; G2/M-M-G1 


398 f 


3stc2 3-2-80 


unknown 


G2/M-M 


399 f 


3stc2 3-3-175 


unknown 


G1/S-S-G2 


400 I 


3stc2 -33-200 


unknown 


G2/M-M-G1 


401 I 


3s tc23-3-305 


unknown 


M-G1 


402 E 


?stc2 -33-85 




S-G2 


403 I 


3stc2 -33-95 


unknown 


G2/M-M-G1 


404 I 


^Stc2 3-4-110 


unknown 


G2-M 


405 i 


3stc2 3-4-120 


unknown 


G1/S-S-G2 


406 i 


3stc2 3-4-310 




S-G2 


407 i 


3stc2 3-4-335 




G2-M-G1 


408 I 


3stc2 -41-110 


unknown 


S-G2 


409. 1 


3stc2 4-2-165 




M-G1 


410 I 


3stc2 -43-105 


unknown 


S-G2-G2/M 


411 I 


3stc2 -43-130 


unknown 


G2/M-M 


412 1 


3stc2 4-3-285 




G1 


4131 


3stc2 -43-77 


unknown 


G2/M-M-G1 


414 I 


3stc2 -43-90 


unknown 


G2/M-M-G1 


415 1 


3stc2 4-4-125 


unknown 


G1/S-S 


416 I 


3stc2 -44-175 


unknown 


G2/M-M-G1 


417! 


3stc2 4-4-220 




G2/M-M-G1 


418 I 


3stc2 4-4-230 




G2-G2/M 


419 I 


istc2 -44-95 


unknown 


M-G1 


420 I 


3stc3 1-110 


unknown 


G1/S-S 


421 I 


3stc3 1-1-250 




G2/M-M 


422 I 


3stc3 1-1-77 




M-G1 


423 f 


3stc3 1-1-90 


unknown 


M-G1 


424! 


3stc3 -12-115 


unknown 


M-G1 


425! 


3stc3 1-2-190 


unknown 


G1/S-S-G2 


426! 


3stc3 1-3-127 


unknown 


G1/S-S-G2/M 


427 I 


3stc3 1-3-235 


unknown 


S-G2 


428 ! 


3stc3 -13-330 




G1 


429 I 


3stc3 1-3-60 


unknown 


G2-M 


430 I 


3stc3 1-3-80 


unknown 


S-G2-M-G1 


431 i 


3stc3 -13-90 


unknown 


G2/M-M-G1 


432! 


3stc3 -13-95 


unknown 


M-G1 


433! 


3stc3 -14-105 


unknown 


M-G1 


434 ! 


3stc3 -14-110 


unknown 


M-G1 


435 I 


3stc3 -14-125 


unknown 


G2/M-M-G1 


436 I 


3stc3 -14-130 


unknown 


G1/S; M-G1 


437! 


3stc3 2-1-108 


unknown 


G1/S-S-G2 


438 ! 


3stc3 2-1-170 


unknown 


S-G2/S 


439 i 


3stc3 -21-70 


unknown 


M-G1 


440 ! 


3stc3 2-2-100 


unknown 


G1/S-S-G2 


441 I 


3stc3 2-2-270 


unknown 


G1/S; G2/M-M-G1 
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442 


I3stc3 2-2-390 


unknown 


G2/M-M-G1 


443 


I3stc3 2-2-93 


unknown 


G2/M-M 


444 


(3stc3 2-3-100 


unknown 


S-G2 


445 


I3stc3 -23-125 


unknown 


G2/M-M-G1 


446 


Bstc3 2-3-155 




S-G2-M 


447 


[istc3 -23-175- 


unknown 


G2/M-M-G1 


448 


3stc3 -23-177 




G2/S-G2-M-G1 


449 


3stc3 2-3-63 


unknown 


S-G2 


450 


3stc3 -23-65 




S; G2-M-G1 


451 


5stc3 -24-155 


unknown 


G2/M-M-G1 


452 


3stc3 2-4-230 


unknown 


G2/M-M 


453 


5stc3 2-4-250 


unknown 


G2/M-M-G1 


454 


3stc3 -24-255 


unknown 


G2/M-M-G1 


455 


3stc3 -24-305 




G2-M-G1 


456 


Sstc3 -24-340 


unknown 


G1/S-S; M-G1 


457 I 


3stc3 -24-90 




M-G1 


458 


3stc3 -31-130 


unknown 


G1/S-S-G2 


459 


3stc3 3-120 


unknown 


G1/S-S 


460 


3stc3 -31-200 




S-G2 


461 I 


3stc3 -31-260 


unknown 


G1/S-S 


462 I 


3stc3 3-150 


unknown 


G2/M-M-G1 


463 ( 


3stc3 -32-105 


unknown 


G2-G2/M 


464 I 


3stc3 -32-120 




G1/S-S; G2/M-M-G1 


465! 


3stc3 -32-240 


unknown 


S-G2 


466 \ 


3stc3 -32-320 




G1/S-S-G2; M-G1 


467 [ 


3stc3 3-280 


unknown 


G2-M-G1 


468 


3stc3 3-2-90 


unknown 


S-G2 


469 I 


3stc3 3-3-105 


unknown 


G2/M-M-G1 


470 I 


3stc3 3-3-115 




G1/S-S; M-G1 


471 I 


3stc3 3-3-165 




G1/S-S-G2/S 


472 I 


?stc3 -34-110 




G2/M-M 


473 I 


3stc3 3-4-165 




G2/M-M 


474 I 


5stc3 3-4-200 




S 


475 I 


3stc3 -34-290 


unknown 


G2/M-M-G1 


476 i 


3stc3 -34-85 


unknown 


G2-M-G1 


477 I 


3stc3 -34-90 


unknown 


G1/S-S 


478 I 


3stc3 3-90 


unknown 


S 


479! 


3stc3 4-115 




G2-M-G1 _j 


480 I 


3stc3 -41-180 




G2/M-M-G1 


481 I 


3stc3 4-13-300 


unknown 


G/S-S;M-G1 


482 I 


3stc3 4-3-100 




M-G1 


483! 


3stc3 4-3-135 




S-G2-G2/M 


484 I 


3stc3 4-3-190 




S-G2-M-G1 


485 I 


3stc3 -43-210 


unknown 


G1/S-S; M-G1 


486 I 


3stc3 4-3-210 


unknown 


G2/S-G2-G2-G2/M 


487 I 


3stc3 -43-240 




G1/S-S; G2/M-M-G1 


488 I 


3stc3 4-3-248 


unknown 


S 


489! 


3stc3 4-3-263 


unknown 


G2/M-M-G1 


490 f 


3stc3 -43-280 


unknown 


G2/M-M-G1 


491 I 


Sstc3 4-3-95 


unknown 


S 


492! 


3stc3 -44-155 


unknown 


G1/S-S; M-G1 


493 I 


3stc3 -44-173 




G2/M-M-G1 


494 ( 


3stc3 4-80 


unknown 


S-G2/S 


495 t 


3stc4 -11-117 




G2/M-M-G1 


496 ( 


Sstc4 1-1-125 


unknown 


M-G1 


497 I 


3stc4 1-1-130 


unknown 


G2-M-G1 


498 I 


3stc4 -11-180 




G2/M-M-G1 


499 I 


3stc4 1-1-195 


unknown 


G1/S-S-G2 


500 ( 


3stc4 1-1-197 


unknown 


G2/M-M-G1 


501 f 


3stc4 -11-210 ' 


unknown 


G1/S-S-G2/S 


502 I 


3stc4 1-1-210 


unknown 


G1/S-S-G1/S 


503 I 


3stc4 1-1-245 


unknown 


M-G1 


504 I 


3stc4 -11-350 


jnknown 


G2/M-M 


505 I 


3stc4 1-1-90 


unknown 


G2/M-M-G1 


506 I 


3stc4 -12-150 


jnknown 


G2-M-G1 
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507 


festc4 1-2-280 




S-G2-M J 


508 


1 3stc4 -13-112 


unknown 


S-G2 J 


509 


!3stc4 1-3-170 


unknown 


G1/S-S 


510 


I3stc4 1-3-205 


unknown 


G2/M-M-G1 


511 


IJstc4 -13-280 


unknown 


G1/S-S-G2/S 


512 


Bstc4 -13-70 


unknown 


G2/M-M-G1 


513 


3stc4 1-4-105 




M-G1 


514 


3stc4 1-4-112 


unknown 


G27M-M 


515 


3stc4 -14-120 


unknown 


G1/S-S; M-G1 


516 


3stc4 1-4-127 


unknown 


S-G2-M 


517 


Sstc4 1-4-145 


unknown 


G2/M-M-G1 


518 


Sstc4 -14-160 


unknown 


G2/M-M-G1 


519 


Bstc4 1-4-165 


unknown 


G2-M-G1 


520 


3stc4 1-4-185 




G1/S-S-G2 


521 


3stc4 1-4-270 




G1/S-S; G2/M-M-G1 


522 


3stc4 2-1-150 


unknown 


G2/M-M-G1 


523 


3stc4 -21-155 




G1/S-S-G2 


524 


3stc4 -21-200 


unknown 


S; G2/M-M-G1 


525 


3stc4 2-135 


unknown 


G2/M-M-G1 


526 


3stc4 -22-150 


unknown 


G1/S-S; G1 


527 


3stc 42-2-170 




S-G2-M 


528 


3stc4 2-2-185 




M-G1 


529! 


3stc4 2-2-220 


unknown 


M-G1 


530 I 


3stc4 2-3-100 


unknown 


M-G1 


531 I 


3stc4 -23-115 


unknown 


M-G1 


532 I 


3stc4 2-3-133 




S-G2/S 


533 f 


3stc4 -23-135 


unknown 


G2/M-M-G1 


534 I 


3stc4 2-4-110 


unknown 


G1/S-S; G2/M-M-G1 


535 i 


3stc4 -24-240 




G1/S-S-G2 


536 \ 


3stc4 -31-260 




G2/M-M-G1 


537 t 


3stc4 -31-310 


unknown 


S; G2/M-M-G1 


538 E 


3stc4 3-3-100 




S-G2-M 


539 6 


3stc4 3-3-103 


unknown 


G2/M-M-G1 


540 


3stc4 3-3-135 




M-G1 


541 I 


3stc4 3-3-175 




G2/M-M-G1 


542 I 


3stc4 3-3-250 


unknown 


M-G1 


543 I 


3stc4 -34-135 


unknown 


G2/M-M-G1 


544 i 


3stc4 -34-185 




G1/S-S 


545! 


3stc4 3-4-200 


unknown 


G2/M-M-G1 


546 I 


3stc4 3-4-320 




G1/S-S 


547 ( 


3stc4 -41-100 


unknown 


G2-M 


548 I 


3stc4 -41-105 


unknown 


G1/S-S; G2/M-M-G1 


549 I 


3stc4 -41-107 


unknown 


G2/M-M-G1 


550 I 


3stc4 -41-125 


unknown 


M-G1 


551 I 


3stc 4-41-180 




G2/M-M-G1 


552 I 


3stc4 -41-220 


unknown 


M-G1 


553 f 


3stc4 4-150 


unknown 


G2-M-G1 


554 I 


3stc4 -42-110 


unknown 


G2/M-M-G1 


555 I 


3stc4 -42-115 


unknown 


G2/M-M 


556 B 


3stc4 -42-130 


unknown 


S-G2 


557! 


3stc4 -42-165 


unknown 


G1/S-S; M-G1 


558 I 


3stc4 -42-217 


unknown 


G2/M-M-G1 


559 f 


3stc4 -43-103 


unknown 


G1/S-S-G2-G2/M 


560 i 


3stc4 4-3-167 


unknown 


G2/M-M-G1 


561 I 


3stc4 4-3-170 




M-G1 


562 I 


3stc4 4-4-120 


unknown 


M-G1 


563 I 


3stc4 4-4-290 


unknown 


G2/M-M-G1 


564 I 


3stt1 -11-190 




G1/S-S 


565 I 


3stt1 -11-200 


unknown 


G1/S-S-G2-G2/M 


566! 


3s«1 -11-55 


unknown 


G1/S-S 


567 I 


3stt1 -11-65 


unknown 


G1/S-S-G2 


568 ( 


3stt1 -12-105 


unknown 


G2/M-M 


569 I 


isttl -12-115 




G1/S-S 


570! 


3s«1 -12-230 




S-G2 


571 I 


3stt1 -13-150 


unknown 


G2/M-M 
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Old. 


tisttl -13-230 


unknown 


G2/S-G2-M 


O / O 


J jSu I -14-125 


unknown 


G1/S-S 




Bsttl -14-220 


unknown 


G2/M-M 


ETC 


Bsttl -21-100 


unknown 


G2/M-M 


575 


Bsttl 2 -1-240 


unknown 


S-G2-M 


oil 


Bsttl -21-250 


unknown 


S; G2/M-M-G1 


r-70 

578 


Bsttl 2 -2-100 


unknown 


G2/S-G2-M-G1 


579 


Bsttl 2 -2-140 


unknown 


G2/M-M-G1 


580 


Bsttl -22-160 




G2/M-G1 


CO 'I 

5o1 


Bsttl 2 -2-215 


unknown 


G2/M-M 


582 


Bsttl -22-225 




M-G1-G1/S 


coo 
ooo 


Bsttl 2 -2-360 


unknown 


G2/M-M-G1 




Bsttl -22-70 


unknown 


G1/S-S 


coc 

585 


Bsttl 2 -3-115 


unknown 


G1/S-S-G2 


coc 

58b 


Bsttl -23-150 


unknown 


G2-M-G1 


587 


I3stt1 -23-170 


unknown 


G2-M 


588 


I3stt12 -3-170 


unknown 


G1/S-S 


589 


I3stt1 -23-180 


unknown 


G2/S-G2-M 


590 


Bsttl -23-185 




G2-M-G1 


591 


I3stt1 -23-235 


unknown 


G2-M 


592 


Bsttl -24-105 


unknown 


G2/S-G2-M-G1 


593 


Bsttl -24-120 


unknown 


G2/M-M-G1 


594 


3stt1 2 -4-260 




G2/S-G2-G2/M 


595 


3stt12 -4-320 




G2/M-M 


596 


5Stt1 -31-120 




G2/M-M-G1 


597 


3stt1 -31-180 


unknown 


G2/M-M-G1 


598 


3stt13 -170 


unknown 


G1/S-S-G2 


599 [ 


3stt1 3.-2-1 50 




G1/S-S-G2 


600 I 


3stt1 -32-170 


unknown 


G1 /S-S-G2 


601 I 


3stt1 -32-185 




G1/S-S 


602 E 


3stt13 -3-100 


unknown 


G1/S-S-G2-M 


603 I 


Bsttl -33-170 


unknown 


G1/S-S-G2 


604 ( 


3stt1 3 -3-320 


unknown 


G2/M-M-G1 


605 f 


Bsttl -33-66 




G2/M-M 


606 [ 


3stt1 -41-120 


unknown 


G2/M-M 


607 I 


3stt1 -42-264 


unknown 


G2-M-G1 


608 [ 


3stt14 -2-280 


unknown 


G2/M-M-G1 


509 I 


3stt14 -3-120 




S-G2 


o10 I 


^ i 1 J a r\ A\ A f\ 

3stt14 -3-140 


unknown 


G1-S-S-G2 


31 1 I 


3stt1 -43-220 


unknown 


G2/S-G2-G2/M 




312 i 


Bsttl -43-330 


unknown 


G2/M-M-G1 




313 f 


3stt14 -3-460 


unknown 


G2/M-M 




314 I 


*stt14-4-130 i 


unknown 


S-G2 




315 I 


istt14 -4-150 i 


unknown 


G2 




316 I 


Sstt14 -4-195 




S-G2-M 




d17 I 


Jstt14 -4-220 




32/S-G2-G2/M 


t 


:>1o I 


istt14 -85 r 


lohits 


32/M-M 


I 

1 
1 

t 


^19 1: 


1 iiA A A A —» r\ 

*stt21 -1-170 l 


jnknown 


32/M-M 


1: 


istt2 -11-290 


( 


32/S-G2-G2/M 


£ 


521 I: 


$stt2 -11-540 


( 


31/S-S 


L 
t 




stt21 -2-190 


( 


32/M-M-G1 




523 I; 


istt2 -13-165 


I 


3-G2-M 


c 


524 B 


stt2 -13-170 i 


jnknown ( 


32/M-M 


e 


>25 13 


stt2 -14-130 i 


jnknown C 


32/M-M 




326 B 


stt2 -14-175 l 


jnknown J 


5-G2 


€ 


)27 13 


stt22 -1-140 l 


jnknown $ 


5-G2 




528 13 


stt2 -21-300 i 


jnknown ( 


B2/M-M 


c 
C 

e 
e 


529 (3 


stt22 -2-110 l 


jnknown C 


31/S-G2 


lor\ i-i 

>o0 13 


stt22 -2-255 


C 
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CLAIMS 

1. A method for identifying and validating plant genes/proteins as targets for agrochemicals, 
said method comprising the steps of: 

a. Determining gene or protein expression profiles during a biological process of a plant or 
5 plant cell, said biological process being necessary for the growth and/or development 

and/or viability of the plant or plant cell; 

b. Selecting genes or proteins having altered expression during said biological process, 

c. Cloning said selected gene or the nucleic acid encoding said protein in its full-length or 
partial form, 

1 0 d. Incorporating said nucleic acid in a vector designed for downregulation of expression of 
said nucleic acid or the sequence homologous to said nucleic acid in a plant or plant 
cell. 



15 



2. The method according to claim 1 , wherein said biological process cell division. 

3. The method according to claim 1 or 2, wherein said gene or protein expression profiling is 

based on nucleic acid or protein samples collected from a synchronized culture of dividing 
plant cells. 

20 4. The method according to claim 3,wherein said dividing plant cells are tobacco BY2 cells. 

5. The method according to any of claims 1 to 4, wherein the expression profiles are 
determined by means of micro-array, macro array or c-DNA-AFLP. 

25 6. The method according to any of claims 1 to 5, wherein said downregulation involves a 
viral-induced gene silencing mechanism. 

7. The method according to any of claim 1 to 6, wherein said downregulation involves the use 
of infectious DNA of virus is Tobacco Rattle Virus and wherein said plant is tobacco. 
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8. A method for screening candidate agrochemical compounds comprising the use of any of 
the methods according to claim 1 to 10. 



9. A method for screening candidate agrochemical compounds comprising the use of any 
35 one or more of SEQ ID NO 1 to 785 or a homologue, functional fragment or derivative 

thereof or one or more of the proteins corresponding to SEQ ID NO 1 to 785 or a 
homologue, functional fragment or derivative thereof . 

37 



BNSDOCID: <WO, 



03085 115A2J_> 



WO 03/085115 



PCT/EP03/03703 



10. A method for the production of an agrochemical resistant plant, comprising the use of any 
one or more of SEQ ID NO 1 to 785 or a homologue, functional fragment or derivative 
thereof or one or more of the proteins encoded by SEQ ID NO 1 to 785 or a homologue , 

5 functional fragment or derivative thereof. 

11. An isolated nucleic acid identifiable by any of the methods according to claims 1 to 10. 

12. An isolated nucleic acid, comprising at least part of a nucleic acid sequence chosen from 
10 the group of SEQ ID NO 1 to 785 a homologue, functional fragment or derivative thereof. 

13. Use of a gene nucleic acid according to claim 11 or 12 or the protein encoded by said 
isolated nucleic acid as a target for an agrochemical compound. 

15 14. Use of a nucleic acid or protein according to claim 13, wherein the agrochemical 
compound is a herbicide. 

15. A plant tolerant to an agrochemical, in which the expression level of one or more of the 
nucleic acids corresponding the SEQ ID NO 1 to 785 or the homologue, functional 

20 fragment or derivative thereof, is modulated. 

16. A harvestable part of a plant according to claim 15. 
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Sequence Listing 

GROUP 1 
SEQIDNOl 

GAATTCACTAGTGATTGATGAGTCCTGAGTAAGGTGAGACGAGAAGCGACCTTCTGACCACA 
AGACTTGTCAGCCTGAGACAGGTATGATATCCATATACTGCGTATCTCATAAGTGACTCGTG 
GATCGGATAAATGCTCAACCCATTTGCTAACATATCTGTCTTGCCTGTCAGGTTCCCAGGAT 
CACTACGCAGTCAATCGAATTCCCGCGGCCTATAGTGAGTCGTATTAA 

SEQIDN02 

TGACTGCGTAGTGATCTCAAAGAAGTGGAAGTTACGAGTGCTCGAGATGTGATGCAGCAGCT 
TCTTCAGGGTGCTGCAAACAGAAAGGTAGCTGCAACCAACATGAATCGTGCTAGTAGCCGTT 
CACACAGTGTTTTTACATGTGTGATAGAGAGCAAATGGGAATCTCAAGGAGTAACTCACCAC 
CGGTTTGCTCGTCTTACTCAGGACTCATCA 

SEQIDN03 

GNNATGCCCGATNTAAGCCGCCCCTANATACANTTNAAATGGTCCCGGANACCCTGGGNGAC 
AATNATNGACTGNGCAGTGGTTGAAGNTTGACAATTCCTATT 

SEQIDN04 

CNCNATTNTNAATAAGCCCGAAAAAGAAGAAGTAANGGAGGGAGAAGGCCTGAAGAGAATTT 
GCNGGATTTTCTAGAGTTGATACAGAAGTTCTTCCAAACTGTTCCACCTTGGATTATGACAA 
GTAGTGTCACCAACCAAGGTCGAGATGAGATACTGTTGCACATGTCTCAGCTGCGAAACTAT 
TGGCTCAAGCATTGAGTTGGCATCATATGA 

SEQIDN05 

TTTTANGNCANCAAATCTCNCTCTAACGGACCCTNGCATGGCTTGTTCAAAATAAATGCCTC 
AGGACAATACCACGTTATGTAATGGGGAGTGAACTGCGTATATCCGTTCTGCTCNTTTATCT 
GGGCGGNGCCTTTGAAGTTTTTGACAAACTCTNTCTGGNTCTCACACTTAGGGCCACACTCA 
TCATTACTGTTTGTCC7yy\ACTCGTACTCAACTCTTTCATCGGGATGTGGAAGCGCCTCTCT 
CCAATCAAGGTTTATG 

SEQIDN06 

CTTGGATGGTCNACCAGATTGAAGAACNCGAGAAAAAGCTGTTTTCTCATCCACTTCATAAG 
TCACAAAATGAACANCAGCCNTTGAGAATCNCAGCTGTGNTATGTANNTTCGAAGACATTGG 
CTGAGGATGCTGCATGGAAGTTTGTGAAAGAGAAAGCCTATCGATATGGTTACGATAAACCC 
AGCAATGGTTATTGGCGGTTTGTTACAACCAATAC 

SEQIDN07 

CCCAAGATGAACAGTCAGTCAAGTCGATCTCATGCCATATTTACAATTACATTGGAACAAAA 
GAGAATGGCTAATTGCTCGACGAACGATGATGGTGATGACATATTATGTGCCAAGCTTCATT 
TGGTTGACCTTGCTGGTTCAGAGCGAGCAAAGCGAACTGGAGCTGATGAGATGCGTTTACGA 
NAGGGNATTCATATCAACAGGGGATTGCTTGCTCTTGGCAATGTAATAAGTGCCCTTGGTGA 
TGAAAAGAAGCGGAAAGAAGGNGCACACATCCCATACAGAGATAGCANGTTGACACGTNTCT 
TACAGGACTCACTTGGAGGAAACAGCANGACAGTTATGATTGCTTGTGTCAGTCCTGCTGAC 
ACCAATGCAGAGGAGACCC 
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SEQIDN08 

GCGGTTGATATGTGGTCTGTGGGATGTATTTTTGCCGAGATGGTTCGAAGGCAAGCCTTATT 
TCCTGGTGACTCTGAGTTTCAGCAACTGCTTCACATATTCAGGCTGTTAGGAACCCCAACTG 
AGAAGCAGTGGCCTGGAGTCAGTTCACTCCGCGACTGGCATGTTTATCCAAAATGGGAACCT 
CAGAACTTGGCCTCTGCTGTTCCAGCATTGGGTCCTGATGGCGTGGACCTCCTCACGAAAAT 
GCTCCAATATGATCCGGCAGATAGGATTTCAGCAAAAGCTGCACTTGATCATCCATACTTCG 
ATAGCTTGGACAAGTCTCAGTTTTGAGGTTGCTTCTACTTCTAAGATCAGCC 

SEQIDN09 

GCAGCNAGCNAAGNTNNGGTNGGGNACGCCAANNANNGNTGTGCCTTTGATGACGTCACCAG 
NTATCANTCTACATANAACGGAGGNCTTGCGANNGGCTTGGNTCATTCTACNGNTCTAGGAT 
TNTCAACTCTNNTCAATTCTTCNAT7LACTNACCTATTCTCCTGCAGCAATATGTGAGACGTA 
ACCTAGAATATTATTTGCCTTTATAGATATTGACTTATTCTGCTTGCATATTTTATCTGCAG 
CCGGTGGCATCTACCATCAATACATTGGCTGGTGCATTGTATAAGGTGTTTTGTGCTTCACC 
TGATCAAGCTAGGAAGGAGATGCGCGATGCATGCTTTGACTATTTGAGCCTTGGTGGAGTAT 
TCTCCACAGGACCTGTATCTTTGCTTTCTGGC 

SEQIDNO10 

GTCAGTGCTTGAGCTGAACCCGTTGCTTGGACTTGACAACTAGCATCTTCTCTTTGCATGCT * 

GCCCTCATGTATTGCCAATGTAATTTCTCCTCTAGCAAACCATTATGTATTACAAACTATTA 

TTATGATTGTGAATAACTTGTGAAAAGTTCAATCAATCTGAAAGAAATAATCTCT 

SEQIDNOll 

CGTTGNTTGTTTCGGGAAATTGGAACAGCATTGGTGAAGGCACTTACGGNCAAGTGTACATG 
GNTAAAGAAATTAGAACAGGGGAAATTGTTGCNTTGAAGAAGATACGCATGGACAACGAANG 
AGAAGGGTTTCCAATANCTGCTATACGTGAAATCAAAATCTTGAAGAAGCTGCACCATGAAA 
ATGNGA 

SEQIDN012 

TAAAGGACCGNTTTTGTTTCGAGAAATTGGNNCAGATTGGTGAAGGCACTTACGGTCAAGTG 
TACATGGCTAAAGAAATTAGAACAGGGGAAATTGTTGCTTTGAAGAAGATACGCATGGACAA 
CGAAAGAGAAGGGTTTCCAATAACTGCTATACGTGAAATCAAAATCTTGAAGAAGCTGCACC 
AT G AAAAT GT G A 

SEQIDN013 

GGACGTTTGCATTTCGGATTNGNGCACGAGATGTTNATGATTTTAGGATTTATTTTAGTCAT 
CTTACTCGGNTGATGTTTATTCGTTTTTGTGACTTTTACTCGNGGGCGGNGGTGACCGCGTA 
CATGCTATTTATTTGATTTTTACTATGGNTATTGNTTATTGTTA 

SEQIDN014 

TTACNTTTACTGAGATNNTTATGATTTTAGGATTTATTTTAGCCATCTTACTCNGGTGATGT 
TTATTCGATTNTGTGACTTTTACTCGNGGGCGGTGGNGACCGCGNACATGCTATTTATTTGA 
TTTTTACTATGGTTATTGTTTATTGTTA 

SEQIDN015 

TATCAAATGGAGAAGTTATCAATATGAAAATAGCTGTCAAGCCACTTCAACTATTGCTAGGA 
AGCAGCAAACTGTGACGCGAGATAAACATGACACAGAACTCATTGCTAGGGGTNGNNATGAT 
NCTTGTGTAGTTCCCNAANCTGTNCCANTGNTTTAAGCAATGGTAGCCCTGGCGCTAGTGGA 
TNAGCTAATGGCTCATTATGCACAGTGTATGCTGTTCCCAA 
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SEQIDN016 

TTGAAGAGTGGAAGTTACGAGTGCTCGAGATGTGATGCAGCAGCTTCTTCAGGGTGCTGCAA 
ACAGAAAGGTAGCTGCAACCAACATGAATCGTGCTAGTAGCCGTTCACACAGTGTTTTTACA 
TGTGTGATAGAGAGCAAATGGGAATCTCAAGGAGTAACTCACCACCGGTTTGCTCGTC 

SEQIDN017 

GGGCCCGCCACCACCGCAACCACCTAGTTATTCCTCCGTCGAACACGTGTCTCACGAGAGTG 
AGAGTGAGAGCGTTCATCGTCAGCATGATCATCATCGTTTTCAACCACATGTGCCTTCATTC 
TTCCACCATGAGACCTCACCACATCCAGAGCTCATCGATAAGCCTTCATTTAGGGTTTATAC 
AAAGGCTGATCCCAATTACTCTCTCACTATCCGTGACGGCAAAGTCGTTCTTGCCTCTTCTG 
ATCCATCCGATCCTTTTCAACACTGGTATAAAGATGAGAAGTACAGCACTAAAGTGAAGGAT 
GAAGAGGGGTTTCCAAGCTTTGCTCTGG 

SEQIDN018 

GTNTNTGTGGCCCACCTGCTGCNAGAGTGACACACAGGNATGATCTTAGAGCNGCNATTCAG 
AAGATGTTAGACACTCCTGNGCCATACTTGNTGGATGTGNTTGTACCTCATCAGGAACATGT 
TNTACCTATGATTCCCAGAGGCGGNGCTTTCAAAGATGTGATCACAGAGGGTGACGGGAGAA 
GNTCCTATTGANTTTGAGNNGCTACAGAGCTAGTTCTAGGCCTTGCATTATCTAAAATAAAC 
TTCTA 

SEQIDN019 

GCAAGGAGTCAAGTGGATATTTTGGATGATGGTTATAGATGGAGGAAATACGGACAGAAGGC 
TGTCAAGAACAACAGATTCCCAAGAAGCTACTACCGATGCACGCATCAAGGATGTAACGTGA 
AGAAACAAGTACAAAGGCTGTCAAAGGATGAAGGAGTAGTAGTAACTACTTATGAAGGCATG 
CAT T C AC AT C C CAT T G AG AAG T C C AC AG AT AAC T T T GAG C AC AT T T T G AC T C AG AT G C AAAT 
CTATGCTTCCTTTTGAAACGTCCATCACTTCAATGCCTAAGGCATGACACTCAATTAGTCAC 
TTGTAAAATAGTACTACAGTATATTGTGTACATGCGTTTTGAACCTAGATGCTATATTTTGA 
AATAAAACGCAACTTCATTAGGGAATTTAATTTGATCATTGTACAACTAAAAGTAATGTTGC 
TATTTTTTTGTTTTTATCACTTTGTTTTTGCCGGAGCCATGCTCTTCATTTTAACTCTTTTC 
T T T T AG AAT T AAC AAAT AAT TTCATGTTG GAG AAAG AT AC G T G C C AAAAAAAAAAAA 

SEQIDNO20 

TAATGGACACGGATCTGCACCAGATAATACGCTCTTCACAAGCACTGACAGAAGATCACTGC 
CAATACTTTCTCTATCAATTATTACGTGGACTCAAGTATGTACATTCAGCTAATGTCCTCCA 
CCGGGATCTGAAACCTAGCAACTTACTACTCAACGCAAACTGTGACCTCAAGATTTGTGATT 
TTGGGCTAGCTAGAACCACTTCAGAGGCGGATTTTATGACTGAGTATGTTGTCACCCGCTGG 

SEQIDN021 

TTTGCAGCCACCTTTNACATTTCGGTAGANGATNTGTCCATAACAAGCCTGACTTTTNTAAA 
GGAATTTACAGCTTCAATTGAAGCAAAACAAGTGGCTGCTCAAGAAGCTGAAAGAGCAAAGT 
TTGTTGTGGAAAAAGCTGAGCAAGATAAGCGAAGTGCTGTTATCAGAGCTCAGGGTGAGGCT 
AAGAGTGCCCAGCTTATTGGTCAAGCGATTGCCAATAATCCGGCATTTATCACACTCAGGAA 
AAT C G AAG C AG C AAG AG AG AT T G C C C AG AC TAT C T C AC AT G C AG C AAAC AAG G T G TACT T G A 
GTGCCGATGATCTGTTGC 
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GROUP 2 
SEQIDN022 

ctgaaccctaacgcacacaacttcactctttgctcctccaaatctctctccaatgcaggatt 

tcatcggctccgttcgccgatctctggttttcaagcagtccggagacttcgataccggcgct 

gccggtgtcggcagcggattcggaggcttcgttgagaaactaggttcgagcattcgcaaatc 

gagtattggaatcttctcgaaagctcatgttcctgctcttccgtctatttctaaagctgagc 

tgcccgcgaaggctcggaaagatgacactccgccaatccggtggaggaaaggtgaaatgatt 

ggatgtggtgcttttggtagggtttatatggggatgaatgttgattctggagagttactcgc 

tataaaggaggtttcgattgcgatgaatggtgcttcgagagagcgagcacaagctcatgtta 

gagagcttgaggaagaagtgaatctattgaagaatctctcccatcccaacatagtgagatat 

ttgggaactgcaagagaggcaggatcattaaatatattgttggaatttgttcctggtggctc 

aatctcgtcacttttgggaaaatttggatccttccctgaatctgttataagaatgtacacca 

agcaattgttattagggttggaatacttgcataagaatgggattatgcacagagatattaag 

ggagcaaacatacttgttgacaataaaggttgcattaaacttgctgatttcggtgcatccaa 

gaaggttgttgaattggctactatgactggtgccaagtcaatgaagggtactccatactgga 

tggctcccgaagtcattctgcagactggccatagcttctctgctgacatatggagtgtcgga 

tgcactattatcgaaatggctacaggaaaacctccttggagccagcagtatcaggaggttgc 

tgctctcttccatatagggacaaccaaatcccatccccccatcccagagcatctttctgctg 

aatcaaaggacttcctattaaaatgtttgcagaaggaaccgcacctgaggcattctgcatca 

aatttgcttcagcatccatttgttacagcagaacatcaggaagctcgcccttttcttcgctc 

atcctttatgggaaaccccgaaaacatggcggcgcaaaggatggatgttaggacctcaatca 

ttcctgatatgagagcttcctgcaatggtttgaaagatgtttgtggtgttagcgctgtgagg 

tgctccactgtatatcccgagaattccttagggaaagagtcactctggaaactaggaaactc 

tgatgatgacatgtgccagatggataatgatgattttatgtttggtgcatctgtgaaatgca 

gttcagatttgcattctcctgctaattataagagttttaatcctatgtgtgaacctgataac 

gattggccatgcaaatttgatgaaagtcccgagttgacgaaaagtcaagcaaacctgcatta 

tgatcaagcaactattaagcccactaataaccccatcatgtcatacaaggaggatcttgctt 

tcacatttccaagtgggcaatctgcagccgaggatgatgatgaattgacagagtctaaaatt 

agggcattccttgatgaaaaggcaatggacttgaagaagctgcaaacaccactatatgaagg 

attctacaattccttgaatgtttccagcacaccgagtcccgttggcactgggaacaaggaaa 

atgttccaagtaacataaacttaccaccaaaaagcaggtcaccaaaacgtatgcttagcaga 

aggctctctactgccattgaaggtgcttgtgctcccagcccagtgactcattccaagcgaat 

atcaaatattggtggcctaaatggtgaagctattcaggaagctcagttgccgaggcataatg 

aatggaaagatcttcttggttctcaacgtgaagcagttaattcaagcttctctgagaggcaa 

agaaggtggaaagaagagcttgatgaagagttgcaaaggaaacgagagattatgcgtcaggc 

agtcaacttatcaccaccaaaggatccaattctaaatcgatgtagaagtaaatcaaggtttg 

catctcctggaagataaatgtatgtacttgtgtccctaaactaaagtcagtttgaagaatat 

aattaatgatcctgcaaccccagaacagagagttagatgtcttgagcaggtatacgaacgtg 

aggttttcttgacccgttactacaggaatatcagcgcttgtcagatagagtgagctgttact 

acaggaatatctgtcaacctgttaatcatattataaaatgccaataatttgcgttgtattcg 

ttttgatcattctcctgagagcattgtaagaaaaatgcaggcctttttataacctatataag 

tgctctctcatggtagttgccaatattaaaacgcagagaaaagtcgagttctcatctgctga 

attgtttgtaaaatgtgatatattaatgtatttaccgtcttacaacc 
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SEQIDN023 

ccacgcgtccgtgatatgggatgtcacattgatggatttattgctgtagttggacatacaca 
tgttcttcacgaaggaccagttactggtagacctgctgacgtcattgcagctgctaatacag 
ccgctgaagttgctttgaggcttgtgagaccaggaaagaagaactcggatgtaacagaagct 
attcagaaagttgctgctgcctatgactgcaagattgttgagggtgtcttgagccatcaaat 
gaagcaatttgttattgatggaaacaaagttgtattgagtgtgaccaatcctgaaacgagag 
tagatgaagcagaattcgaggagaatgaggtttactccattgatattgtgacaagcactggt 
gaaggaaagccaaagttgttggatgagaaacaaacaactatctacaagagagccgtggacaa 
gagctataacctgaagatgaaagcatcaaggtttatcttcagtgaaatcagtcagaagttcc 
ctatcatgccatttaccgcaagggatttggaggagaagagggctcgtctgggcctagttgag 
tgtgttaaccatgagcttttgcagccatatcctgttctacatgagaaacctggtgatttggt 
tgctcacattaaattcacagtgctgttaatgcctaatgggtcggataggattacatctcatg 
ctctccaggagctgaagcctgcaaagtcgatagagaatgaacccgaaatcaaagcctggctt 
gcccttcccgttaagaccaagaagaaaggcggtgggaagaaaaagaaagggaaaaaaggtga 
caagacagaagactcatctcaagctgagccaacggaaggatagagaaatggtttcaaatctt 
gataaatagcaattttgaggtgcttgatcgatcaacttcactgaaactattggttcactgtt 
ggtcggcactttcagctgcctttgttcttccttgtggggctttgctatacaagggacagaca 
gttattgtcctcttgtactgtcatgttaaattactcagttttccaatgctattcaacatgct 
ctcaatcggtctttaaaaaaaaaaaaaaaag 

SEQIDN024 

ccacgcgtccgcaaaaccctagctcaaatcccgtttgcctccattttcattccatcaacaaa 

aacctaagtttatactcagcttgagacatttgataactatgtcggacgacgagagagaagag 

aaagagttggatctgacaagtcctgaggttgttactaagtacaaaaatgctgctgaaattgt 

taacaaggctctgcagttggtggtgtctgaatgcaagccaaaagcaaagatagttgatcttt 

gtgaaaaaggagatgccttcatcaaagagcaaactgggaatatgtacaagaatgtgaagaag 

aaaatcgagaggggtgtggcatttcccacctgcatttcagttaacaataccgtgtgccattt 

ttctccactgtctagtgacgagacagtattggaagaaggtgatatggtgaagattgatatgg 

ggtgtcatatagatggctttattgctgtagttggtcatacacatgtgctccaggaaggacca 

gttactggtagagcagctgacgttgttgcggctgctaatacagctgctgaagttgccctgag 

gcttgtgagaccaggaaggaagaactcggatgtaacagaagctattcagaaagttgctgcgg 

catatgactgcaagattgttgagggtgttttaagccatcaaatgaagcagtttgtgattgac 

ggaaacaaagttgtgttaagtgtgtccaatcctgaaacgagagtagatgatgcagaatttga 

ggagaatgaggtctattcaattgacattgtaacaagcactggtgaaggaaagccaaaattgt 

tggatgagaaacaaacaaccatctacaagagagctgtagataaaagctacaacctgaagatg 

aaagcgtcgaggtttattttcagtgaaatcagtcagaagtttcctgtcatgccatttacagc 

aagggatttggaggagaagagagctcgtttgggactcgttgaatgtgttaaccatgagcttt 

tgcagccctatcctgttctacatgagaaacctggtgatttggttgctcacataaaattcaca 

gtgctgttgatgcctaacgggtcagataggatcacaactcatactctccaggagctgaaacc 

tgctaagacaatagaggatgaacctgaaatcaagacctggttagcccttcccgtaaaaagca 

agaaaaaaggcggcgggaagaaaaagaaagcgaagaaaggtgagaagacagaagactcatcc 

caagctgaaccaatggaaggagaatcaaatggtgctgaatcttgatatgttgctagaacttt 

gatttgattcaattccaagaactatttgttgattgttagttaaatgtgggatattgaggtag' 

ttgtggatctttctttgcggcattttgcaatacaagaatggcatggacagttgttgtccttg 

tcttgacacatttgtcatgctggaattattaagtggggtttccaatgctataatgtcatgtg 

tatcaaaaaaaaaaaaaaaagggcggcaactctagagtatca 
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SEQIDN025 

ccacgcgtccgcttggggattagcaggttgtcgacaaagaaaattcatttgtttcctacgat 
cacacaagtcgtggttgttgcagatccgctttcgctaaggggaaactcaaaagcccagttcg 
tgtagttcatccaaagatgagttccagcaaaagggttgggaagtcttctaattcatcaggaa 
agcagaaagctatatgcgaaacaactactacaccaacggttgatgatataaatgtaggcgta 
gaagatatggggttgaactccgatcaaaatgatggatggatagtgtgttctagaaagtccaa 
gaacaagggtggaagcagcagtgctggaatgaagcaatggatttctcagaatcccactccaa 
aagccaaactgggaatgcgtaacaatattgttggatcatcaggacaggggtctaggaataac 
tggtccacacctaattatcatcctcgaaaacctgctggcagagaatgctacacaccgacacc 
cgctgcagttcctcctgccctgaagaatggttgggattggtcatctgtcgctcgttccaatg 
aggaccatgatacttattcccctgtcgctgatgtcaaggcttcctgtgaacatgatggagag 
gataatgaatcggatttgcctgatgatgacagtgatgatgagcttccgagtgatgacgactt 
tgatgatcactcggatgtaaatgaaatgagtcatgaggtactcaaggaaagtcgttggttca 
agaaccttt t caaatgtcttgacagtttgactgtcact gaga ttaat gat ccggaaagacag 
tggcactgccctgcatgcaaaggtggtccgggtgcaattgagtggtttccagggatacagtc 
agtgatgaaccacgcaaaaacgaaaggatttaggatgaaattacacagacaacttgctcaac 
ttttggaggaagagctgcgtcggaggggaacttctgttgtacctccaggtcaagtgtatgga 
agatggggtggcggtgaatatgaagataaggaaatagtgtggccaccgaccgcgattatcgt 
gaacacagtgcttgagaaagatgaaaatgacaagtggattggaatgggaaatcaggagctgc 
gtgattatttcagctcttatgctgctgtcaaggcagcgcgaagctcatatggtccacaaggc 
catcgtggtatcagtgtgttgatttttgaggccactcccgtgggatacatggaggctgtact 
tctcagtgagcagttttctgaaaaaggaagtgatagagatgcatgggaacacaatccagttc 
tcttttatcctgggggaaaacgtaagctttatggttacatggcagagaaaagagacatggac 
aactttaaccggcattcacatgggaaatcaaggctgaagttcgagatgaggtcatataaaga 
aactgtttcgaatccagcgatgcagatgtcggaggataatcaacagctcatatggttcaaga 
accaagcctctaagcaccaaaagcgggctaaagctactgaagagtctctaagactggtgagt 
gaaaagcaccgtcagacagtcgaagagaacaagattgtcagactgagaactaagatgcacca 
tgaacggaacaaggaagagatggaatatctagagcagttttttaatgatcagttgaaaatga 
tttatgatgccaggactgctgaggaggacaagtttgaaaagatacagcaggaacagcgtgag 
atgatctatcaatctaatgcaactatttcctcggctgaggatcatcgactcagggcagagaa 
agttgcgaaatttatcaaacttcaggacaaggatatggaagaatttgtggaagagagggata 
atctgataagagctcatgaagatagggtaggttcaatgagacgcaaatacttgctgcaatac 
tcggaagaggcagttgcacttgagaagaattttgatctcgaactggctaagctgatggagaa 
gtactcatcaaagcaatctgagcaggtcaacagcagtgatgccgtgtgaccctatagtaata 
ctattcaagcgccgttttagctttaaatttctgtgaacttgggattcttcactgacttttat 
aatcctggtctgtccatgtgttttgatgatgctaaagaaatgattctaatagttatattata 
tcctaaaacatatggcttgaactatttgttctagaaaaaaaaaaaaaaaaaaagggcggcc 
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SEQIDN026 * 

ccacgcgtccggccatggtagcaaaacagttagctgatgacgaaccacaaaaaaccctcaag 
gattcaccaaagtttgaatccaaatcccataagaaaaaacataagagaaagctcgaagaccc 
tgaacctgaagaagttactgttatagagtccaagaaagagaaaaagaagaagaaaaagcaga 
aacagaaccaagaacaagaagggtctattgtaaacagtgaaaatcttagtgggtctaatggc 
aaggttgaaactattaatgggtcagctgagttctctgaaaaaagtagtacaaatgtggtggt 
aactggtaaggatgctaatgagtcaaagtacaaagctttagcaaaatttgtggattcagggc 
ttccaagtgatgtgttagattgttgcaagaattttgagaaaccatcaccaattcaatcacat 
tcatggccttttcttttagatggccgtgatttcattggaattgccaaaactgggtcaggtaa 
gactttggcttttggtattccggctattatgcatgtcctgagcaagagaaagagtaaaaagt 
ctaagaatccgctttgcctcgtgctttcgcctacaagggagctagctcaacaaatatcagat 
gttctctgcgatgctgggaagcctactggtgtgcagtcagtttgtctatatggtggagtcga 
taagcatcatcaaaaagcttctcttaaatctggtgtggatattgttattggaacccctggtc 
gtttgcaggatatgatggaaatgggagcatgcaacttaaaagcggtttcttttgtggtgcta 
gatgaagctgatcggatgctcgatttaggttttgaacctgctgtccgtgccattttgagcca 
aacatgctctgttcgacaatcggttatgttcagtgctacatggcctccggctgttcatcaat 
tagctcaagaattcatggatcctcatccaatcaaggtagttgtaggttcagaagatttggct 
gccaacca t gat gtcatgcaaattgtcgaggtcttggaagatcgagcccgt gat gagcgttt 
acagtgcttgctggaaaaataccacaagtttagaaagaacagagtattggtttttgttttgt 
acaagaaggaagcatctcgggttgaaattatgctacagaaaaggggttggaaagttgtgtcc 
attagcggtgacaagcaacaacatgctcgtactaaggcgttgtcactctttaaggatggaag 
ctgtcctttaatgatagctactgatgtagctgctcgaggtctggatatcccagatgttgaag 
ttgtgataaattatagttttcctttgacaacagaggattatgttcatagaattggaagaact 
gggcgagctggtaaaaaaggtgtagctcatacattcttcactaaggacaacaagggactttc 
tggggagttgataaatgttctcagagaggctggacaggttgtgccagctgcccttcttaatt 
ttggaacccatgtaaagaaaaaggaatcgaagctctatggtgctcattttagagaaatagat 
gcaaatgctccaaaggctacaaaaataaaatttgacaattctgatgaggaagattgagaagc 
aatatcattattaccaaagcaacacaactccattgaattggctcatcatcctgacattccgt 
gcaatcatttggcggatacatgtagaagtggattactgcgggaagaatgcaagagatatctc 
actgctcatgtatatggtaattgaagcttaaatctattggcgcttcaacctgtcatagataa 
tgagtttgaaatactattgtgtttttgtaccttaatattcttttcacccatacagttggctt 
agtaaggtttttctaggatccaaatgtagtaatacacttattataatttgcccttttaagtg 
atgtatgtatgattgcaccttccaaattactgcacttggcaaaaggtggaaaaatattcgaa 
aatgagattcaaaactggttcatgcaaaaaaaaaaaaaaagggcggcc 
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SEQIDN027 

ccacgcgtccggtggaacgccacgttgccattctctcttcggtgacaagcttcaaacgccag 
gcgtctctctcgtatctgagtgctggtgttttttcttctcaggttcaagttccggcagttaa 
tgcggcaaattcaaatttagatgttatgcaatagctgcgtaagatttgtgttttttcaagca 
gcacatattgatacacacattgtgcaaaggcaatttatcaaactcattaaaagtttgaatgc 
aactcgtagcataagttattcttggaatagtgtatatacagcaaggagaacgatgatggtgg 
atactggagcaactgctaaaggaggacctgtcgttgatgtttcaccggagaaggatgataat 
aatggtggtttcgctagcggaggatggaagagtgaagatggaagactgagttgtggttattc 
aagctttagagggaaaagagccaccatggaggatttttatgacattaaaacttggcaaagtt 
gatggacaaacaggtagcttatttgggatatttgatggccatggtgactctcgcacagctga 
gtttctgaagaaacatctctttgagaatctaatgaaacatccagagttcccaacgaacgcca 
agctggccataagtgaaacatatcaacaaacagacatggacttcttagattctgaaaaagat 
accttccgagatgatggttccactgcttcaacagcagttctagttggtaaccatctctatgt 
tgccaatgttggagattcacggactataatatcgaagggcggaaaagcaattgctctttctg 
aggatcataagcccaatcgaactgatgagaggaagagaattgaaagtgccggaggtgttgtg 
atgtgggctggtacctggagagttggtggtgtattagcaatgcacgtgcttttggcaaccgt 
atgttgaagcaatttgttgtggctgaacctgagattcaggatcaagagattgatgaggaatt 
agaactactcgtgcttgccagcgatgggctctgggatgttgtaccaaatgaggatgctattt 
cacttgcacaagcagaagaagaaccagaagcagctgctaggaagctaaca^aaactgcattt 
actcggggtagtgctgacaatattacctgcatagtggtgaagtttcaccacaagaaggttga 
accagaggggagccagcaaggttgaagaatttgttgatgctgcatctgccttttcctggtgg 
aaggctgcttcaatgatgccggtgcaagttgctgacgatagcatcacaggggctgtcatttt 
ttcattcatttctttgcattgtttttccccgtcatcctgtttaactgttgtatttaaggtgt 
ctgcgtttgtgcgtctgctttctccttttctgtagaggtattgtctggataaactttactgt 
gaaacgtagttaaaaggttaaaaaaaaaaaaaaaag 

SEQIDN028 

ccacgcgtccggaagaaatggttgaattcatggaaaaggtcttcaactccctcggctcagaa 
gaactcaccgtggaggaacgaaacctcctctccgtcgcgtacaagaacgtgatcggagcgcg 
tagggcatcgtggcgtattatctcatcgattgagcaaaaggaagagtccagagggaacgagg 
aacacgtaaactctatccgcgagtacagatctaagattgagaatgaactctctaagatctgt 
gatggcattctgaaattgctcgatgcaaagcttatcccttctgcagcatctggtgattctaa 
ggtgttttacctgaaaatgaaaggagattaccaccgctatttggctgagttcaagaccggtg 
ctgaacgtaaggaggctgctgagagtactctcactgcctacaaagctgctcaggatattgca 
actactgagcttgccccaacacatcccatccgacttggactggctcttaacttctctgtgtt 
ttactatgagatcttgaactctcctgaccgcgcttgcaatcttgctaaacaggcctttgatg 
aagcaattgctgagttggacacactgggcgaggagtcttacaaggatagcactttgatcatg 
caacttcttcgtgacaatctcactctctggacatctgatatgcaggatgatggggctgatga 
aatcaaggaagatcccaaacctgatgaagccaaaaattgaaggaattgaaactctctaattt 
gcttttcacttcttcctggttgtttttattggaagaagctgattatcgtaatttcctttact 
attatggttttccgctagggggttgtcttattggaaatgaacaacttttaatattgatgttt 
cagaagttccatctttaatttaatgtggtttttctggtggtaaaaaaaaaaaaaaag 
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SEQIDN029 

ccacgcgtccgctactgtttcttcatcgctatgccgtcagttccgcttcctacttgactgaa 
tctgccgccatggaaggggatatctacacatctaactgcagaattgttacataaacactgat 
gggaatgcagaagagatattctcaaatattcaaggtagcattttgaaaaatgtcagatgata 
tggtcattcattttgcatccaattcttcaaaccaatcagaccagtctctgcccacaaagatt 
gctaaacttgaggcaagaatggtgggcaaagcctcatctacatctacatcccgagctacttc 
ctggtctgccccagccaagtttgggcttgggcctgggcctgctgacaatgttgctgagcttg 
ctgtctctagtgattctgatgatgatgatgataatggaagggaatttctcatacaagcaaac 
actcagaagcggcgcaaactcgaggatgacaacagctcaacttcatttgaacatgtggagac 
agcagctgatactgtgaaaaagatagtagacaatacagacacaagcaaagtgggttcagatg 
tgaatagacggaaacaaagccgtgtcaagggacaaactaattctggtagaggacgtggttcc 
cgagttagtgatcagaccaagtcacaagcagtttctgtgtcaaatggtcagctcgagaactc 
ttaccagaaggatggtttgccaaaagagcaaattgggcacgatcgacagactgtattcgaag 
aggagatcacttctttacgggcaaaagttgtggctttggaggaagagcttaagaaatcccgt 
caagaggcatcagattatcaacatcagtgtcaacagctggaaaaggaattgaaggatcttaa 
agattatgagcagcagacaaagccaaagagaacgaaaataatatctgaattgttaatatctg 
tttcaaaagctgagaggcaagaggcacgaatgaaagtgcgacaggaatctttgagactgggc 
aacgtgggagtaatcagagctggaaccattatttctgaggcctgggaagatgggcaagcact 
aaaggacctcaatgctcagcttagaaacttattagaaactaaagaagctattgaacggcagc 
gtaaattgctcaagaaacgacaaccagataaaagtgatggaggagatgtggagggaggtttg 
caggaagaagattctct cat tcaggat gaga tctacaaatctcgtttagccagcatcaaacg 
tgaggaagatgtgataatgcgtgagagggaccgatatgaactagagaaaggaaggctaattc 
gtgaaatgaaacgcatacgtgatgaagatggttctcattttaacaattttcagattttgaac 
caccgatatgccctcttaaaccttcttggaaaaggaggatttagtgaggtgtacaaggcttt 
cgacttggtagaccatagatatgttgcatgtaagctacatggactaaacgctcagtggagtg 
aagagaagaagcaaagttatatacggcatgcaatcagggagtacaacatccacaagactttg 
gtgcaccatcacattgtgcggctttgggacatttttgagatagaccaaaacaccttttgcac 
tatcttggagtactgtagtggaaaggaccttgatgcagttctcaaagcaacacctgtgttgc 
cagaaagagaagcaagaa teat cat tgtgcagatttttcaaggccttgtct act tgaataag 
aagtcacagaagatcatccattatgatttgaagccaggcaatgttttatttgatgagtttgg 
cattgctaaggtcactgattttggccttagcaagatagtggaggatgatgttggatcccagg 
ggatggagctaacatcccagggagctggaacgtactggtatctacctcctgaatgctttgag 
ctaagcaagacacctcttatatcctcaaaggttgatgtctggtcagctggtattttgttgta 
ccaaatgctgtttggcaaacgtccctttgggcatgaccagtcacaagaaagaatactaaggg 
aggacacaattattaaagcaagaaaggttgaattccctacacgaccagctgtctctaatgag 
gcaaaggagttcattcgtcgttgtttaacatataatcaagcagataggccagatgttttaag 
tattgctcaagacccttacttgacatactcaaagaaatgataggaggatgttaatcccaact 
acttggacagagggtattgggacgaggattggtgctcaaaggaattttgtatagttgtaaag 
ccatgtaattttttgtccctgtaccttcgactagagtggggcggctcaaggggagctttgct 
ttaggccccaaaattttgggggcatttgcatctatacccagtttttgggttaacttttaact 
tatatccgcattgcaaaaaaaattgcaagcatacctacttttcgggtaacttcagacattcg 
ggtctgaagtagcaaaaatttatgtctgaagtttgaacttcagaatgttttgcctgaagtgt 
agtaaaacttcagatatttttgcctgaagtttggcctgacttgcaaagtcaatcacgcaaac 
ttcagttcatagtgcaatggcaaacttcagctcaataaaattacagcatgttttggctgaag 
tttttgttttgtaattgttgaacttcagcattttaggaactgaagtttgttttgtaattgct 
gaactttagcattctaggggtgaagtttgttttgtatttgctgaacttcagcattctaggag 
ttgaaatttttgtttatatttgctgaacttcagcattcttagagctgaagttctaagtctgc 
acacggaaatgaggaagataacc 

FIGURE 4 (continued) 



BNSDOCID: <WO 030851 1 5A2_I_> 



WO 03/085115 



10/140 



PCT/EP03/03703 



SEQIDNO30 

ccacgcgtc.cgacgaaatccaaccgtcgaatctcaggcaacaggcggcagctcatttcaccg 

ctgtaacaaaaattcgagagaatggcaatggtagatgagccattgtaccccatagccgtgtt 

aatagatgaacttaagaacgatgatatacaattacggttgaattcaattaggaggttatcga 

ctattgcacgtgcccttggtgaggaaagaactcgaaaggaattgatcccttttttgagtgaa 

aacaatgatgatgatgatgaggtgttattggcaatggctgaagagcttggtgtgtttatccc 

ttatgttggaggtgtagagcatgctcatgttttgctcccgccgttggagacgctttgtactg 

ttgaggagacctgtgtgagggataaagctgttgaatcgttgtgtaggattggatctcagatg 

agggagagtgatttggttgattggttcgtccctcttgtgaagaggctggcagctggtgaatg 

gttcacagctagagtttctgcctgtggactctttcatattgcttactcaagtgccccagaga 

tgttgaaggcagaacttcggtctatttacagtcaattgtgtcaagacgacatgcctatggtg 

cgaagatcagctgccacaaacttggggaagtttgctgctactgttgaatctacttacctcaa 

gagtgacatcatgtcaatatttgatgatcttacacaggatgatcaggattctgtacgcttat 

tagctgttgagggctgtgctgcacttggcaagctgttggagccccaggattgtgttgcacac 

atcctgcctgtcattgtcaacttctctcaggacaagtcttggcgcgtccgctacatggttgc 

taaccagtt'gtacgaactatgtgaagctgtagggcctgagcccactaggacggatttggtgc 

ctgcctatgtccgtttgcttcgagataatgaagctgaagttcgcatagctgctgcagggaaa 

gtcaccaaattctgtcggattcttagtcccgagcttgctattcagcatattcttccctgtgt 

gaaggaattatcatcagactcttcacagcatgtcagatctgctttggcttctgttataatgg 

ggatggctcctgttttgggaaaggatgcaaccattgagcatcttcttccaatatttctttcc 

cttctgaaggacgagtttcctgatgtgcgcctgaacatcattagcaagcttgatcaagtcaa 

tcaggtgattggaattgatttattatcccaatctttgttgccagctattgttgagctagcag 

aggacaggcattggcgagtccgtcttgcaataatagaatacatacctctattggcaagtcaa 

ttgggcataggattttttgatgataagcttggtgccctttgtatgcaatggttacaggacaa 

ggtttattcaatcagagatgctgctgctaataacctaaagcgtcttgcagaagaatttggtc 

cagagtgggcaatgcagcatataattcctcaggtcttggatatgactaccagtccacattat 

ttgtatagaatgacaattcttagagcaatttcattgcttgcacctgtaatgggctctgaaat 

aacttgttctaaattgctgcctgtggttattactgcaacaaaggatagagtgcccaacatta 

aatttaatgtggcaaaggtgttgcaatcccttatacctattgttgaccactcggtggtggag 

aaaaccattcgccctagtttagtagagctagctgaagaccctgatgttgatgttcgctttta 

tgccaatcaagcacttcagtcaattgataacgtcatgatgtcaggctagagaatataactgt 

ggtgagagtactacaaatctctcttcaaatccctctttggtaggattttgctctcacacgaa 

gacgcaaaagagaaaatgtgcaagcaaaatgcattctgttgagcttggagtcgtatattgtt 

actaattcttttgtaggatttgacattcaagatgctgtgacactaatgaacaccgagtgttt 

tttcatgtaaagttactgccgtactatttagatctgctaagctcatgtatttgcttttgtta 

gtgtacttttttggtgtttgaacttacaactttttacctgcgttattctagcagatttgttg 

cgtttacattagcgtttgcgtttcttcctagccgatgttatgtttgagcagtgcccccgcca 

ccctctctttttctcaggtcttatgctttctatgtgttttttcatgccgatagaatgtatgt 

ggaacttttagtacttattattttttatgttgtatttgttggcttgagatgagcaacataaa 

taataagaaactggg 
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SEQIDN031 

ctgtacaaaaaagcggctggtaccggtccggaattcccgggatatcgtcgacccacgcgtcc 
gaggagattgagctgagctgactcaatgtttccgagattgattcaaccacaaggggaagatg 
aatataatatgaatgttgggattcatcatactcataatattaatggagatccttgccttgtg 
ctgacgtcagatccgaagcctcgacttcgttggactgctgaccttcacgaacgcttcgttga 
tgctgttactcagcttggcggtcccagcaaagctacgccaaaggcgataatgcggacaatgg 
gtgtcaagggactgaccctcttccacctaaagagtcaccttcagaaatacagactaggactt 
acagctacatattcattagagagcccttgttctggtggtactcctcagcagttgccggcatc 
ggacttgaatgaaggttatgaagtcaaggaggcattgagagctcagatggaagtgcaaagta 
aattgcacctgcaagttgaagctgagaagcacttgcaaattcggcaggatgctgaacaaagg 
tatattagcatgctggagaaggcctgtaaaatgcttgctgatcaattcattggtggtgtagt 
tactgaaaatgatcaagagacttgccaaggattaggaacaaggacacaagttagccctcttt 
gtaatccacttggattatgcccctcggaatctgctgatcttgttggaatccatggtccagaa 
gaagtttcccccagaatccatccacaattcaccgattgttccactgaaagctgcttaacttc 
gcatgagagtcctgctggacttcccctagaaggaacttcacctggaggaagaaaacgagggc 
cgaatggagattcaacacatgcatcagttgttggggtgaagcagatatgatatcgtcaggtg 
ttcgtctgcttcaagttaatcgctttgggattactagctctaatgttcaaaatgtctcttct 
taagagattagtgctgagtttatctacagccattgattctcaaactgcatattgcggtttct 
gggaatactgatgggccttggacttgtcaagttgtaaatgcaagctgatgactttctaactc 
taactgcgccccctgaacattaaatcctaaaaaaggaagaaaattgagatgcgag 

SEQIDN032 

agcggctggtaccggtccggaattcccgggatatcgtcgacccacgcgtccgaaa'gaagaga 
aaaagatgggtgctgacaaagggaagaagcaaaaagtggaggaagagaacaacaccattgat 
ggtgagctcgttttttccattgaaaaattgcaagaaatacaagacgagctcgagaagatcaa 
tgaggaagcaagtgataaagtattggaagtggaacagaagtacaatgagatccgcaagcctg 
tctatgacaaacgaaacgacatcattaaagctatcccggacttctggttgactgcttttttg 
agtcatcctgtcctaggtgaacttctaactgaagaagaccaaaagatcttcaagtttctaag 
ttctattgaagttgaagactctaaagatgtgaagtcgggctactcgataacctttaacttca 
atgcgaatccttattttgaaaatacaaagctcacaaagacctataccttccttgaagatgga 
cccacaaagatttctgctacaacaataaaatggaaagaaggcatgggcattcctaatggatt 
tgcacatgagaagaaaggaaacaagcgatctcatgctgaggaaagcttcttcacatggttca 
gtgaagtcaatcaaaaagatgaggatgaggatgaggccctagagattcaggatgaggtcgct 
gacataattaaggatgacttgtggccgaaccctctcacctattttaacaacgagcctgatga 
agaagattttgatggtgacgagggaaaggacagtgaaggctctgaagacgaagaggaagaag 
aagaggaggatgaggatggtgatgaagaatgaaggcagtaaactgttcaagacccctatttt 
gggatctcgtcttcagcggttttaatcatcagggtttaatgtctgtaaagaggctttgaatg 
ttgccaaagaacagaataactgtggtgactataccttttcttctcttgtatggttataactt 
ataagcaaaatatctaattccggaggttccaaaatgttttcattaggctagttcgattaatg 
aagtgtttgtctggcaaaaactgataatgttaggttattgagttatg 
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SEQIDN033 

ccacgcgtccgcccacgcgtccgggcagctcatttttaccgccgtaacaaaaactcgagaga 
atggcaatggtagatgagccattgtacccaatagccgtgttaatagacgaacttaagaacga 
cgatatacaattgaggttgaattcaattaggaggttatcgactattgcacgtgcactcggtg 
aggaaagaactcgaaaggaattgatcccctttttgagtgaaaacaatgatgatgatgatgag 
gtgttattggcaatggctgaagagcttggtgtgtttattccttatgttggaggtgtagagca 
tgctcatgtcttgctcccgcctttggagacgctttgtactgttgaggagacttgtgtgaggg 
ataaggcggtggaatcgttttgtagaattggatctcagatgagggagagtgatttggttgat 
tggtttgtccctctcgtgaagaggcttgcagccggtgaatggttcactgctagggtttcagc 
ttgtggactctttcatattgcttactcaagtgccccagagatgttgaaggcagaacttcggt 
cgatttacagtcaattgtgtcaagacgacatgcctatggtgcgaaggtcggctgcgacaaac 
ttggggaagtttgctgctaccgttgaatctgcttacctcaagagtgatatcatgtcaatatt 
tgatgatcttacacaggatgatcacgattctgtacgcttattagctgttgagggctgtgctg 
cacttggcaagctgctggaaccacaggactgtgtggcacatatcctgcctgtcattgtcaac 
ttctctcaggacaagtcttggcgcgtgcgatacatggttgctaaccagttgtatgaactatg 
tgaagctgtagggcctgagcccactaggacggatttggtgcctgcctatgtccgtttgcttc 
gagataatgaagctgaagttcgcatagctgctgcaggaaaagtcaccaaattctgtcggatt 
cttagtcccgagctagctattcagcatattcttccctgtgtgaaggaattatcatcagactc 
ttcacagcatgtcagatctgctttggcttctgttataatggggatggctcctgttttgggaa 
aggatgcaaccattgaacatcttcttccaatatttctttcccttctgaaggacgagtttcct 
gatgtgcgcttgaacatcattagcaagcttgatcaagtcaatcaggtgattgggattgattt 
attatcccaatctctattaccagctattgttgagctggcagaggacaggcattggcgagtcc 
gtcttgcaataatagaatacatacccctgttggcaagtcaattgggcataggattttttgat 
gataagcttggtgctctttgtatgcaatggttacaggacaaggtttattcaatcagagatgc 
tgctgctaataacttaaagcgtcttgcagaggaatttggtccagagtgggcaatgcagcata 
taattcctcaggtcttagatatgactaccagtcctcattatttatatcgaatgactattctt 
agagcaatttcattgcttgcacctgtgatgggctctgagataacttgttccaagttgctgcc 
tgtggttattcatgctacaaaggatagagtgcccaacattaaatttaatgtggcaaaggtgt 
tgcaatcccttatacctattgttgaccactcggtggtggagaaaaccattcgccctagttta 
gtagagctagctgaagaccctgatgttgatgttcgcttttatgccaatcaagcacttcagtc 
aattgataacgtcatgatgtcaggctagagaatataactttggtgagagtactagaaatctc 
tcctcaaatcctctttgatagtcttgggattttgctctcacacgaagacacaagggaaaatg 
tgcaagcaaaatgcattctgttgagcttggagtcgtatattgttactaattcttttgtagga 
tttgacattcaagatgctgtgacactaatgaagaccgagtgtttttaaatgtaaagttgctt 
ctgcactatttagatctgctaagctcatgtatttgtttttgttagtgtacttttttggtgtt 
tgaacttcccacgttttctgcg 
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tttgtacaaaaaagcaggctggtaccggtccggaattcccgggatatcgtcgacccacgcgt 
ccgagaaattagcagttagagacactgagaagcagcagctctcttcctcagctgctgtgtgc 
ttaggcaaagaataaaatgggggcagacaaagggaagaagcagaaagtggatgaggaaaaca 
acaatgttattgatgaaaagctcattttttccattgaaaaattgcaagagatacaagacgag 
ctcgagaagatcaatgaaaaagcaagcgacgaagtgttggaagtagaacagaagtacaacga 
gatccgcaagcctgtctacgataagcgaaatgatgtcattagctctatttctgacttctggt 
tgactgcttttttgagtcatcctgttcttggtaaccttctcactgaagaggaccaaaagatt 
ttcaaatttgtaagttctattgaagtggaagactcaaaggatgtgaaatcgggtcattcaat 
cacgtttaactttaagcccaatccttattttgaaaattcaaagctctcaaagacgtatacct 
tccttgaagatggacctacaaaaattacagctacaacaataaaatggaaagaaggcatgggc 
attcctaatggagttgctgacaagaagaaaggaaacaagcggtcccacgctgaagaaagttt 
ctttacatggttcagtgaagtcaatcaaaaaggtgatgtggatgatgacgaaaatgagattc 
tggacattcaggatgatgaggttgctgaaataatcaaggatgacttgtggcctaaccctctc 
aattatttt^accatgagcctgatgaagaagatattgagggcgatgagggaaaggacagcgg 
aggctctgaagaggaagaagaagaggaagatgatgaagatgaagaagacgaatgaactgttg 
gtagaccttgtgtttgatttgagttctcatcagtgtttcaatcatcagagttggtctctgta 
aagaggtttcggatattgcagaaaaattgaatgacatatagtggtgactctaatttttagtt 
tcagtga 
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ccacgcgtccgatcttgaaaaagttccattcttttttctccttctgcttcttcttctgattg 

aagattctgaacctgttctaagtttatggataggtggaatacttcactgagtggatattaca 

actacccttttcgattcttgcatttttattccatttttgttgtgattgtgttttcttccatt 

ttccctattatatcagctgggaggaggtcagatggggttattgt'aactcaagctgattttca 

agcacttaaggccattaaacatgaactgattgattttagaggaatcttgacaagttggaatg 

atagtggtttaggagcttgtgctggtggatggataggtataaagtgtgttaatggggaagtt 

atagctatacagttgccttggaagggattaggtggcacaatctctgaaaaaattggtcaatt 

acaagctcttagaaagcttagtattcatgacaatgttattgctggtcttgttccaacttcgt 

tgagtttccttccaaatcttagaggtgtttatcttttcaataaccggctttcgggttcaatc 

ccaccaaccattggcagatctccacttcttcagactcttgatcttagcaacaatcagctcac 

tggtactatccctcctagtcttgcaaattccacaaggttatacagactcaacttgagctaca 

atgcactttcaggttcaatcccagtaagttttactcaatccccttctcttacttttcttgca 

cttgaacataacaatctttctggctctattcctgatacttggggtaatgttgttgtgaacga 

taagtcttatcaacttcagtatcttacccttgatcacaatcttttatatgggaaaattccag 

cttcaattagcaagttaagtatgcttgaggagattaatcttagtcataaccaaattaatggg 

actattcctgatgaattaagtgcacttacaaggcttgctcttcttgatttatctaataattc 

cataaatggaactattcctgttagtttctccaatctttcagctcttgttactttgaatttaa 

agagcaatcttttggataaccaaatcccagatgtcatatatagattgcaaaatctttcagtg 

ttggatttgagtaagaataagctcactggccatattcctgccaccattgggaatatttctag 

gctcaactcacttgatttatctgaaaacaatttcactggtgaaatcccaaactctcttgttt 

ctttggcaaatttgacttctttggatgtctcttataacaatctttctggggttgtcccatct 

cttctttctaagaagttcaattcaagtgcttttgttggaaatctagagctatgtggatatag 

tccctcaactctatgtgcttcaccacctcctcaaactcttcctccttctcctattggtgggg 

ttgccaagcctcgccatcgcaaacttagtactaaggatatcattctcatagcatctggagca 

cttctagttgttctacttcttttgtgttgcatgctactttgctgcttgattaggaaaaaggc 

aaattcgaaagcaaaaaatggtagtaaagccagtggcttagctaccacagggagaggtgcaa 

agccggttccagcagcagcagcaggtgctgaagttgaatcaacaggtggaaagctagtccat 

ttcgatggaccattcgtgttcacagcggacgacttgttatgtgccactgcagagataatggg 

aaagaacacttatggaacagcatataaggctacattagaggatggtaatcaagttgctgtga 

agaggctgcgcgagaagatcacaaaagggcaaaaagagtttgaagctgaagttgctgaatta 

ggcaagattcgacacccaaatattttggctcttagagcctattacttgggacctaaaggaga 

gaagcttcttgtctatgactacatgcctaatggaagtctctcatccttcctccatgctcgag 

gtcctgagacaacaatagactggcctacaaggatgaggattgctattggcataacaaaaggc 

atctgctttttgcataccaaagaaaacataatacatgggaatcttacatcaagcaacatact 

acttgatgagctaaacaacccaaagattgcagatgtaggcctttctaggcttatgacaagtg 

ctggaaacaccaatgtgattgccactgcaggcacactaggttatcgtgcaccagagctttca 

aagatcaagaatgcaagtactaagaccgatgtctatagtgttggagttatcattttggagct 

cttaactggaaagtcacctagcgaggcaacagatggactcgatttgccacagtgggttgctt 

ccattgtaaaagaggagtggactaatgaagtgtttgatgtggaacttatgagggatgcccct 

aatattggtgatgaattgcttaatactttgaaactagctttgcattgtgttgatccaacacc 

aactgctcggcctgaagctgagcaagtacttcagaaattggaggagattaaaccagagctga 

tgttagcaccccccagttctggaaatgatggcgctgcagttcaagaaaaaaatgaataaact 

cagtaaggtttgattgctaaaagtgtattgaaaaaggtttaggagttccagcttttttactt 

gattgacacccacctatttattctttcatttttttttttgatccagtggagtgagttgttgt 

ctcctattagttctattagtaaactgtatatccgagcttctgattgctgcatagatgcaaaa 

cgcattttgttcaattccctctattctttgcaatgtaatgcaataatagtatctatcttttt 

gatgacatcaacacacgccacgtg 

FIGURE 4 (continued) 
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SEQIDN036 

ccacgcgtccgccgtgatgtaatcttggtgatgctacttattcccttttcccttcttgagcc 

caaactcaagaaggtcaaaaacaaaaaaattacaaaaagctggaatcttgcagtttttttat 

ttaatttatttatcctatgttgaattaatttttggggtcaatatttcccaatttgtagtctc 

caatggagcctcgtgttggtaataagttccggcttggccggaaaatcggtagcggttctttt 

ggagagatctatctcggcgctaatgttcaaactaacgaagaggttgcaattaagctggaaaa 

tgtgaaaacaaagcatcctcaactattatacgaagcaaagttgtataaaatactacaaggag 

gaactggaatccccaatttaaaatggtttggagttgaaggagattataatgcccttgtgatg 

gatttgctggggcctagtcttgaagatctcttcaacttctgcagtaggaagctgtctttaaa 

gaccgttctcatgctcgcagatcagatgattaatcgggttgaatttgttcatgccaaatctt 

ttcttcatcgagatataaaacctgacaactttcttatgggattaggaagacgtgcaaatcag 

gtctatatgattgattttgggctggccaagaagtatagagactcatcaactcatcagcatat 

tccgtatagagaaaacaaaaatttgacaggaactgctagatacgcaagcatgaatactcatc 

ttggcattgaacaaagtcgaagggatgatttggaatcgctgggttatgttttaatgtacttc 

ttaagaggaagtctcccttggcaggggctgaaagcaggcactaagaaacagaagtatgagaa 

gatcagtgagaagaaagtatcaacatcaatagagaccttgtgtaggggatatcctgcagagt 

ttgcatcatatti:tcattactgtcgatcactaagatttgatgataaaccagattatgcttat 

ctgaagagaattttccgtgatcttttcattcgtgaagggtttcaatttgattatatatttga 

ctggaccattttgaaatatcagcaatcacagcttgccaatcctccatctcgtgctcttggtg 

gtactgctgggccaagctcagggatgcctcatgctcttgttaatgttgagaggcaatcaggt 

ggagatgaaggtcgaccaactggttggtcttcatcaaatcttacacgtaataagagcacggg 

gctgcatttcaattctggaagcttattgaagcaaaaaggcacagttgctaatgatttatcca 

tgggtaaagagttatccagttctaattttttccggtcaagtggaccattgaggcgtccagtt 

gtctctagcatccgagacccagtgattgcagggggtgaacctgacccctccggcactctgac 

aaaagatgcaagcccgggaccattgcgtaaagtatccagtgctgcacggaggagttcaccag 

ttgtgtcctcagatcacaagcgcagctcctctatcaaaaatgccaacataaagaatttagag 

tccaccgtcaagggaatagagggtttaagttttcgatgatgagggactgcattagtagctgt 

gctttgtctcagttctccgttcactgtaaattttggcacaccaacttggggagtaagagttc 

tgatattagttgctgtcaggaagtaccataaagctgaattatacaattaaaatttgggatcc 

aatcgcaaaagcacattaaggatatgatggggttgcagatccaaactcacagattccagttt 

atgctcgtccatacagttataggcactttccatattcttttctttaatctctgtctcttgct 

tgttattgttatgtcgtggtattcttgttgaggtcatgtttgtgaattgcgaagatggtcat 

gtataattgccgagaaatcatgtactagtttgttttaaacatgagcaaactgttattttgtt 

caagctactttaatatcaaaaaaaaaaaaaaaag 

SEQIDN037 

Ccacgcgtccgcccacgcgtccggaagaagaagcctgctgccatggcttccgagaaagaagc 
tgctcttctcaccgttccttcagattctcctaccttatttgacaagatcattaacaaggaaa 
tcccagcaaacattgtctacgaggatgacaaggttttagctttcagagacataaatccccaa 
gctccggtgcacattctgcttattcccaaggtcagggatggcttgactggactgtccaaggc 
tgaagaaaagcattgtgaaattcttggtcaacttctttacaccgcaaagcttgttgctaaac 
aagaaggtctgctcgagaatgggttcagacttgtgatcaatgatgggcctagtggatgccaa 
tctgtttatcatcttcaccttcaccttctcgggggacgacagatgaactggccacccggcta 
aaggaagccgagatgaattccagatctcatggagtatccagacttcatccgatcatctatgt 
gtagcacttactgaaaacactatcgtctatgtgtagcgtttgaagaatcaagctctaagctc 
gtcctatgctcctatggagtgacaaataggactcattccgactattatattgatcatcaata 
agagggatttctctgaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa 
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ccacgcgtccgcaatattttgactaatacactgttctgttcttcacctaattcttttcttct 
tctaataataacgtgctgctaagtcctaaagctcctctttggagctccaattaactccaaac 
taccaaaatccagagcaggtttaacaaagaatgggggttggaggacgtgaagtggcgatttc 
attggatggagtgagggacaagaatatgatgcaattgaagaaaatcaatactgcaattttcc 
cagttcgctacaacgataagtattacactgatgccattgcctctggtgatttcaccaagcta 
gcatattacagtgatatttgtgtaggttcaattgcatgtcgccttgagaagaaggaagctgg 
ggctgttcgtgtttacataatgactctgggtgttttggctccatatcgtgggctaggtattg 
gcaaaatgttgttgaaccatatcctcaatctttctgccaagcagaacgttagcgagatttat 
ttgcatgtgcacacaattaacgaagatgccctcaatttctataagaaatttggatttgaggt 
tactgataaaatccaaaattattatacaaacataaccccaccagactgttttgtcctgacca 
agttcatcactcaaacgaagaaatagatagtctcagctactttgattgagccttggtcaaac 
cttcacattatctttgaggttctgagctttctggttctagtttttgctacttatgagtaatg 
tgccacccattggattgttagtgtaagcccttttctgttctatcttatcctatctgcaacaa 
catcaaagttgaatgatttccctgtaatagaaatagtgcagttcaatgcaaacattcgagtt 
tggttatgttagatcacg 

SEQIDN039 

gatatcgtcgacccacgcgtccgaaagagaagaaatattaaatagcacaagaaaaatggaga * 

gtgctaatgcatattctacattgccaatggaaaatgttaacgatgttgggcttattaatttc t 

atggacgaggctaactttgaacaatttattgagctcattaagggtgaaactgctgaccctat • - s 

cgtgaagttttgccccaactatgactgtgaacacattacaggttgttttccttcgactgatg 

tccaatttgagccaacaccaatggatatctttgattggaatgctacaaacatatctaatcct 

atttcacttttttcttccctccccggagaaatgaagctccgggaagaagaagaggaggagga & 

agacgacaatgattacgaggaatcttctgggacaacaactaccaccaccatgactatgttgc 

cggcaacgccaacaaagaagagcacgaggactgaccgatcaagaactttaatttccgagcga . \ 

aaaagaagaggaagaatgaaagagaagctttatgccttgcgttccttagttcctaatataac ^ 

aaagatggataaagcctccattataggagatgcaatactatatgtacaaggactgcaaacgc 

aagcaaagttactaaaggcagaaatagcaggtcttgagtcttcctcaaatgaaatgaacaat 

aatccatttcagaataccaagcaaatgaaattgatgactcattatcctgcaatcaagaggat 

atcgaagatggacatttttcaagtaggagaaagaagcttttacgtgagattagtatgcaaca 

aagggcgacaagttgctggttctcttttcaaagctcttgagtctctttctggattcaatgtt 

caaagctccaacttggctacttctgccgatgattatattttgacgttcactcttaatgtgag 

cgaatgtgaggtagacatgaacttggccaatttgaagctatggatagctagtgcttttctta 

atcaagggtttgacttcgagacattaccattggcctaacgtttcattattgtaattgtgcag 

agttttaaccggtcaaagaatgagaaatgtcattatttatcggtcgtcatttgtaacttttg 

attatttagagtcacgtattctaaaagagtaaagtttgtcaaattgcaatggcgcgcatcgc 

actgtgtacatgtgaccgacctaattgtttattacggttgactttgttactactacttttgg 

aatcaaaacagtcatggcgggcgcg 
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ccacgcgtccgctttccacattctctcaactttctctttctaaaactcttcctctttttcta 

gcacacagaccttcaatggcatcgccgcgcgaggagaacgtgtacctggcgaagcttgctga 

gcaagccgaacgctacgaggagatggtagagttcatggagaaagtcgtcggcgccggcgacg 

acgaactcaccgtcgaggaacgcaacctcctctccgtcgcgtacaaaaatgtgatcggagcg 

aggcgagcgtcgtggcgcataatctcatcgatcgagcagaaagaagagagtcgcggtaacga 

agatcacgtggcctccattaaaacctacagatctaagatcgaatctgaattgacttcgatct 

gcaatggtatccttaagttgctcgattcaaaactcattggcaccgctgctaccggtgactct 

aaggttttctatttgaaaatgaagggagattattacaggtacttggctgagttcaaaaccgg 

agctgagagaaaagaagccgccgagaatactctttcggcttacaagtcggctcaggatattg 

ctaatgttgaattagcccctacacatccaatccgattggggctagctctcaatttctcagtg 

ttttactatgagatattgaactctcctgaccgtgcttgtaatctcgccaaacaggcatttga 

tgaggctattgcggagcttgacaccctgggagaggagtcctacaaggatagcactttgatta 

tgcaacttcttcgtgataacctcactttgtggacctcggatatgcaggatgatggaactgat 

gagatcaaagaaccatcaaaagcggaggagcagcagtaatgtgagtgaagcctctttgctta 

ggattgaatcctatggcataactttgctcattgatcgaaatttgctgtttgtgtagttctga 

attccctgaattgtaatacctaaaagcactgtttcttgccatttgttgttttcagcaaagat 

tactttttctctcggtatttcccttgtatttggatgctccagtgaaactctcttatttcgtg 
gaaatgaatgcttg 

SEQIDN041 

ccacgcgtccgcccacgcgtccgctccatgtttcatttactttggagttggttgctaaaaca 
gattaaagctagctgctaagctagtactgttagagttttgttaattagaagaaactaaagag 
tcaaaaacagtggatccaaggcatggaaagaggggacttttcatccaatgaaatggaaatgg 
aagagaaagagaataacgataatattgatgatcctcaacttcaagaggagctctataatata 
tactcagctcgatctcagcatgacatgtctgctatggtttctgtcctttctcaagttattgg 
aaacagtaccacccattcttcttctgctaatgctactccattaaccctacctcaatctgctg 
tagctctccaaaaccaatctcaatctattgaggatcaagggaattcgagaagaaaaaggtat 
agaggagtgaggcaaagaccatggggaaagtgggcagccgaaatccgagacccaaagaaagc 
agctagagtatggcttggcacttttgaaactgctgaggctgcagcaattgcctacgatgaag 
cagctctcagattcaaaggcaacaaagccaaactcaacttccctgaaagagttcaaggcaaa 
ttccaataccttactactactactagtcaaaatcatcacttgcctgataatattgttcaaca 
acaatatattccaactagctccaataataatcatcctctcccttgtcaagaacattatccta 
gtttacatcactatgctcagctacttcagagtgacagcaatattactgatttaaacttcggt 
atctcgccaagttataatcagcagttatctgcttcttttgattttgcgcaatcatcatctaa 
cagtacattatcggaattgccagcttcttatgagcagaggcaattacaatcaagttacaagc 
aagaagaagaagttttaatgagattttcatcgcattttggtactacttcaagctcatctgga 
cctcatgaaagtaactgggaagagtttgaagatagaaagtcataagttcattccctagtatt 
aagagatacgaagactgaaagaagttttatgagatttccgtcgcattttatttcgtagttta 
tggttttactggggttttctgtcctctgatcttgtatttcagttaagtgtaatagtagaact 
atatatattcatgaattaatggaaaaatattggtgtggttttatgtgtttaaaaaaaaaaa 
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ccacgcgtccgcccacgcgtccgcccacgcgtccgatttgcttttccctcctcttctcctcc 
attttcctttaatgtcactaaaacagaagaaggggggaacagaaaagagttagaaaatgata 
ggagggaataatagttttgggaagacaatttgctcaatctgttatgaagatctaaatcctat 
tattgaagacctccaatccatttctatttgcggtcacgtttttcacgagatttgtcttcagc 
aatggtttgaatactgtacaaatggaaaaaagaagaattgtccagtttgcaaacaggcttgt 
tcagaacaaaatgcaaataggctttatttccaatcagttggtgatccaaatgatacaagtct 
gaccaagaaaccccgtgatcatgaagaggatccacgtgaactgagaaatgaggtcaaaagat 
tggaggggaaagttttacagttgacttctactttggagaaacagc-tgaaagatctcaaagaa 
gtcaatgcagagcttttcacatgcaaggaagagttgaaaatagaagcgactctaaagaatga 
agctgtgaaacaggaggcagccattcagcagttgttacatcttaaatccaaggagct agate 
gatcaactttggagtgcataaggctaaaagatggaaatatggctctagatagggagcttgca 
gcactcaagttgagttacaaagaactagtgaccaagtgcgatactcatggaaggcgagaggc 
tcgttctcttaggaaacttgagaagtcaaaagaaaagataaataagttgaagaccagggtcc 
aagaacttgagacggcacttgaaagaaaagaaaaagataatgaaaatttgagaactttgaga 
gctgccaagaaaaactttgagttgtatcaaggaagcaaagaacccaaagttgaccgacgttc 
atatgagaatcagaataaggcacctgctgcgacagaagtagatttatgcatagtcactggct 
catgcaatgatttatctagaccaaggagaaaaagaaagtctaagtctaaggaaaagagtata 
caaaacacggcagaagatattataactggtggaagtcaagtgcagggatcagaaaataagga 
tggaatctcaggttcaaggaattcccctgttattattcttgacgatgatactgatcttccgc 
ttctagatgatgttacacagcatcagccctcgtttcgcatcaggaaagagacttctgcacca 
gttatacttgcccatccaggagatacctgtttttctggtggattattaggtcctgatggtac 
ttactggcacttgggaaaatggtgcaagaaggttaaggacaagggatctggatcactgtctt 
gaggactgcaaggatcaggtgtgactgctgttgatttgattgctgtaggagctgacggtaga 
ggcggttggatcaaagttctgcgatcaatgaatccgggatcattgcaggacaaaaataagag 
tgtcatcagtcaagagatacaagtatgacatgaaatcaagtagttcccagtctcaaggatgc 
ttgcatacagatagcttcttcagaagaaccagtggataaccttattaacagtgctgcttcta 
cattaccaactgtagataatagagattaaattccttatacattgtttaggggtttaattttt 
agcaatctagttatactaccatttgattgaatggtccgaaaagaagaaacttaatgtcttct 
tttgagcatgtaaagtagggattcaaaggggaaaggtagcatacaggggagagaaaagcaag 
aaagcaacttcagcaattgtttcttagcggttttcagttatgttgcttgctcaatccatatt 
gaaagtatactcttggtaaaaaaaaaaaaaaag 
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ccacgcgtccgatcaaatcttggcaactatggcttccttattatcggtgccattgtctctat 

catcctcatcaatgcaattaagactatcatcaaggatagactccattacttcaacaaccaag 

ctagtgaaggaggagagccaagtagagctcctcagaaagttttaccaattgagatgtccttg 

gatgagctaaatatactaactgataactttgatgagaaagctctgattggaaagggatctta 

tggctgcgtttttggtgctaaattaagcaa[tgaccaacaagtagcaataaagaaattggata 

ctagttcttcaccagaaccagattccgactttgcagatcagttagcaatggtttcaagactt 

aagcatgagcattttgtgactctaatgggttattgcgtggaagcaaacaatcgaatcttggt 

ttatgagtttgcaaccatgggcacgttgcatgatgtattacatggtagaaagggagtacaag 

gtgctaagcctggtctacttcttacctggaatcagagagttaaaat.tgcttgtggtgtagct 

agtggcctcgaatatctacatgaaaaagttgaacctccaattattcatcacgatgttagatc 

tagcaatgtactactctttgatgatttcacagcaaagattgctgatttcaacttgacaacct 

ctgaatcttcagaggactttggctaccatggtccagagtatgccatggaagaagagataaca 

aagaaaagtgatgtttatagttttggagttattctattggagctgttgacaggaaggaagcc 

aatattagattataaagggcaacagagtcttgttgcatgggcaactcccctattaagtgaag 

ataaagtgaaggagtgtgttgatcccaacctaaataatgactaccctccaaaggcaattgcc 

aaggtggctgctcttgtagaactttgtgttcaatatgaggcagatttccggccaaacatgtc 

aattatggtcagggcactgaagccacttctcaatgcaaattaaactagaacctcaagcatac 

aacattattcttaattgaaacaagctgcagggacagttttaagtaccaacagggctgtgacc 

caagtcatggttccgttgccaatccaaggaaaaaggaaatcttgctaaaggactattggaat 

gttgttttcttaactgatttgtttaagattaaatatattatgttctcattttaaaaaaaaaa 

SEQIDN04 4 

Ccacgcgtccggaatcgacaatgaaattcagccgcgcattcaatgccgcttcagttctctta 
gttcttcttcttattaccatagttacggctaagaagtccggtgatgttacggaattgcagat 
cggtgtgaagtttaagccaaaatcttgcgaacttaaggctcacaagggtgatagagtctcag 
tacactacagtggaaaacttacagatggaactgtatttgactccagctatgagaggaatgac 
cccattgagtttgagcttggaagtggtcaagtgattaaaggttgggatcaaggacttcttgg 
aatgtgcgtgggagagaagcgaaagttgaagatccctgctaaacttggttatggcgagagtg 
gatctccaccaaagattccaggtggtgctacacttgtcttcgacactgagctggttgctgtg 
aatggaaagagatcagcagctgatagtgaactgtgatttaacgatgtctctacactcttcat 
tagcgacttctaaatctatttttaggttatcttatagttatatttgcttctttttgataatt 
tagatactaaagtattggctgctggcaaaatgacacctcaagtgtgtttccttttgtcacta 

gttttttcctctgctaaagttaagtggatggacgatgaactcccaagatggttttgccatca 
ttacttttaaaaaaaaaaaaaaag 
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ccacgcgtccgcgaaccaaaacttccagaacccaatatttacaacctgtcttccgccaccat 
tgacaagaaccttaaaaagcttgcttccaagtaacaataagagaaggttgattggaaaatac 
atttgattatttttattagagaatggcggggactaaggaacagataatggatgttcggtctg 
tggtggaagcagtaacagccgccggagat gat gtt gaga tt gat act cctctt tat gttgtt 
gaaagcctctgtatgcgctgtggtgaaaatggtacaacaaggtttctcttaacactgattcc 
acattttagaaagatattattgtcagcatttgactgcccacattgtagtgaaagaaataatg 
aagtggagttcgctggtgaaattcagcctcgaggatgttgctatggcttgcatattccatca 
ggtgatcaaaagatgctcgaccgaacggttgtcaaatctgaaagtgctaccatcaagatccc 
tgaactggattttgagatcccccccgaggctcagcgtggatcattgtcaacggtagaaggca 
tactggttcgagctgctgacggtttgggggcccttcaagatgaacggaagaaagtggatccc 
cagatggctgaagcaatagatcggttcttgataaaactgagagcttgtgcttcaggagattc 
atcctttactttcattcttgatgatcctgctggtaacagctttattgagaacccgttagctc 
catctcctgatccctcattgaaaatcacattctatgagcgaactcctgagcaacaggcagct 
ttagggtatcttgccgacccatcacagcttggaggacaaagtgatgaggtatcaagtgaggg 
tataaataatgttcctcatcacctgctaaaggaaccacatggatcagttggagcaagagcag 
gacgtcaggctattgctcagggtaacagtgcagaaatagctgaagctctatttcgatattca 
gctcctgaagaggtgatgatgtttccatcaacttgtggagcatgcgctgcgaggtgtgactg 
tagaatgtttgttaccaatattccatactttcaagaagtaatagttatggCatcctcttgtg 
atgcttgtggttatcgcaactctgagctgaaacctggtggtcctatatctgataagggaaag 
aaaattacccttcatgtggaaaacattgaagacttaagccgtgatgtgattaagtctgatag 
tgctggagtggaaattcctgagcttgagttagagcttgctagtggcactttgggtggaatgg 
tgacgacagttgagggtttaatcacaaaaattaacgaaagtcttgagagagtacatggattc 
acatttggagacagtcttgatgaagacaggaagagcaagtggttggacttccgagcaagact 
agacaagcttttgagcttgggacaaccgtggacattgatcatcgacgatgcactttcaaatt 
cttttgttgcacctgcaaccgatgatatcaaggatgacaaacagttaacatttgaagattac 
gtaagatcgtgggagcaaaatgaggagctgggtcttaatgacatggacaccacctcagctga 
tgctgcttacagttcagcagatgctgcacccagtgagaaagctgacgattgatgaatttatg 
cttagtgattttctttcatactgctttggccttaaaatctaaggtaagcgttgattgttctt 
tcatatgactgtagaagagatctagaaccataaaagattgccaacgcctgcagccatgtcta 
catagtggccttgtgactagaactcctttaaatagagagacaaacattttaattagctatac 
gggttcctttaatcaaacacttcagagttattaacaatgcaatttgttttagaagatagttt 
gcaatgcaatgatttttgacttgtaaaaaatatcaatc 
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ccacgcgtccggtggatttgtgtgcagcaccaggcagctggagtcaggttttaagccggaag 
ttatatctcccagcaaagttgtcatctgataccaaggacggcgatctaccacttattgtggc 
tattgacttacagcctatggctcccattgaaggtgttattcaagtacagggtgatataacaa 
atgctaaaacagctgaagtggttattagacattttgacggatgcaaggctgacattgttgtc 
tgtgatggtgcacctgatgtaacgggacttcatgacatggatgaatttgttcagtcccagct 
gatattggcgggcctaaccattgtcactcacatactaaaaggaggcggaaagttcatagcaa 
aaatttttcgaggaaaagacacaagccttctttactgtcagctaaaactatttttcacagaa 
gtgacttttgctaagccaaaaagcagtcggaattctagcatagaggcatttgcagtttgcga 
gaattactctccacctgaaggatttaatgagaaagatcttcatcgccttcttgaaaagattg 
gaagtccatctggcacagaggacctagattgcagtagtgcatggctggaaggtcctaataag 
gtgt a tattccatttctggcttgtggagaccttagcgggtacgattcagaccgtt cat at cc 
acttcctaaagctgcagatggaacctatcagtgtttagatcctgtacaacctccaattgcac 
cgccatataaacgagctcttgaaatgaagaaagcgtcgaatcaaggaatccaaaacctagac 
aagctttctcttagctcctaatcttaccatccagaattattccattctgtgacattggaaag 
tcgcttatacgtcaccaaatgtaaggactttttattgttaactgcacttgcaattaatgaat 
ttaatgtgttttgttaaaaaaaaaaaaaaagg 

SEQIDN047 

ccacgcgtccgtgcagaaatggcgactcgttatctgactcgttccttattcactgctctctc 
tcgctcatacacttctctttctctctctacacctcctccctctctctcttctttttctctcc 
tccgtcttcgaccgcttattgccgtcacagctgtcaacctccgcagcgtctctccggcggta 
gcaacctcctttcgagggtttgcgactcggcaaacgtcgtcgtctttaaatgacccgaaccc 
gaactggtccaaccgtcccccgaaggagacgatcctacttgatggatgtgattttgagcact 
ggcttgttgtgatggagaaacctgagggtgatcctaccagagatgaaatcatcgatagctac 
atcaaaactctggctaaagttgttggaagcgaggaagaagcaagaatgaagatatactctgt 
ctcgacaagacattactatgcatttggagctcttgtatccgaagaacttcattacaagctaa 
aagaactgcccagggttagctgggtgcttcctgattccttcctggatgttaagaataaagat 
tacggaggggaaccttttatcaatgggcaggctgtaccatacgacccaaagtaccacgagga 
gtgggtaagaaacaatgcccgagctaatgagaggaacaggcgcaatgaccgacctcgtaact 
ttgataggtccagaaattttgagagaagaagagagatgcagaacactggatccaacatgggt 
ggtggacctcccaatatgacgaatgcgccaaccccaaacatgggtgggatgcagcagcctaa 
catgggtgggaggcatcagcctagcatgggaggaccacagcagcccaatatgggaggtgcac 
ctcataactacggtggagcgcctcccaataactatggtggagcgccaaacaatccaaataat 
tttcaatacaatagtggacaaagcaacggaggcatgccttaccaaacaggtccaggaccaaa 
ccagaattatgcttcaaatacatctggtggaaacccttatcagaatccaaacatgcctggaa 
gagatatgccccctccaaatcagaactatgctccgaatacggctggtggaaacccttatcag 
aatcaaaacatgcctggaagagatatgccccctcgaaattatcaataggctgatatagataa 
gtatgaactttgtatttccagagttctgtttcacgaaatgagaacatagctatggtgtgctt 
gataggatgttgctgcgtgtaatagttgaatgtgcaaaacttatatgctttgtgagtatgca 
atgtcaaggtgttctcatcctattgcatcctctatgttgacatgctctctgtcaattctcct 
gatgagtttactagcctgaccaagaatatgttatgctttaccatgttgaatgcttgaaattt 
cagggcctcattgcaggtactgttcaaaaaaaaaaaaagg 
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ccacgcgtccggaaatggcgactcggtatctgactcgttccttattcactgctctctctcgc 
ccatacacttgtctttctctctctacacctcctccggtctatcttttctttctctctcctcc 
gtcttcgaccacttatcgccggcgccgctgtcaacctccgcagcgtctctccggcgggagca 
acctcctttcgagggtttgcgactcggcaaacgtcggcgtcgtt'aaatgacccgaacccgaa 
ctggtccaatcggcccccgaaggagacgaccctacttgatggatgtgattttgagcactggc 
ttgttgtgatggagaaacctgagggcgatcctaccagagatgaaatcatcgatagctacatc 
aaaactctggctaaagttgttggaagcgaggaagaagcaagaatgaagatatactctgtctc 
gacaaggcattactatgcatttggagctcttgtatccgaagaactttcttacaagctaaaag 
aagtgccgaaggttagctgggtgcttcctgattcctacctggatgttaagaataaagattat 
ggaggggagccttttatcaatgggcaggctgtaccatacgacccaaagtaccatgaggagtg 
ggtaagaaacaatgcccgagctaatgagaagaacaggcgcaatgaccgacctcgtaactttg 
ataggtccagaaattttgagagaagaagagagatgcagaacactggacccaacatgggtggt 
ggacctcccaacatgacgaatgcgccacccccaa-acatgggtgggatgcagcagcctaacat 
gggcgggaggcatcagcctagcatgggaggaccacagcagcccaacatgggaggtgcacctc 
ataactatggtggagcgcctcccaataactatggtgggtcgcctcccaataactacggtgga 
gcacctcccaattactatggtggagcgccaaacaatccaaccaattttcaatacaatggtgg 
accaaccaacggaggcatgccttaccaaacaagtccagggccaaatcagaattatgcttcaa 
atacatctggtggaaacccttatcagaatcaaagcatgcctggaagagatgtgccccctcca 
aatcagaactatgctccgaatacggctgacagaaccccttatcagaaccaaaacatgcctgg 
aagagatatgccccctcaaaattatcaataggccattgtatatgagtatgaactttgtattt 
ccagaattctatttcacgaaatagtaacagttgtagccgtc 

SEQIDN04 9 

ccacgcgtccgcaaaaccctaaactcttcaccttcaaacatcaaaatcctctcgcattctct 
ctagtaatggctaccgctaactcctcttctttctcacctgtatcttccccttcaaaccatgt 
tcccctaaagcgagtaggtactcacaatggtagcttccattgagatgaagctcttggttgct 
tcatgattcgtcttacaaacaagttttacaatgctcagattgtccgtactcgcgatacccag 
gtgttggaaacgcttgatgcgggtgcttgatgttggtggggtttatgatcctagtcgagacc 
gttatgatcatcaccaaaagggatttcaagaggtttttggacatggtttcactactaagctt 
agcagtgctggtcttgtttacaagcattttggaaaggagataattgcaaaggagctccaagt 
tgatgaagaacatccggatgttcataggttgttccttgccatttacaagagcttcatggagg 
caattgatgcagtcgacaatggaatcaatcagtacgatacagaccagtcacccagatatgta 
aataatactcatttgtcctcacgagttggaagactaaacttggactggattgaacctgatca 
gtcttctgaaaaggagaatgaagctttcgaacgtgcaatggatttagctggcagtgagttct 
t ggat cgcg tccgcttt cat gtaagatcttggttaccagcacgctcaa teat cat ggagtgc 
cttgctgcaagacacaagattgatcctagtggagagattgtagtttttactacattttgccc 
gtggaagcttcatttgtttgagctggaagaggagatgaagattgatcctcccatcaaatatg 
ctttatatcaggatgataggagcaaaagttggcgagtgcaagctgtgggtgtagctcctgac 
agatttgagagcaggaaagcccttccagctcagtggcgaggtttaagagatgatgaactctc 
caaggaaacaggaattcctggctgtgtttttatcc£catgagtgggtttattggaggaaatc 
aaagttatgaaggagcactcgcaatggcaaaagctgctttgaagctctaggcacaggaacag 
ttttataaatggatttcagaaactgagtgatctctttatgatttaacattatagctgatcat 
gacatcaggttgccatttaaatagcgcattggagttgaatttattcaaggttattaaggaaa 
ctatacacaaccaggcagacagttttttacatattcagatgctatcttttacttttac 
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agcggctggtaccggtccggaattcccgggatatcgtcgacccacgcgtccgtcttcttctt 
cttcttcttcttcttcttcttcttcttcttcttcttcttcttcttcaatttttctctctctc 
tttttctctagggtatacagaaatggggatcgcaacggagaatcaaccacagcaacaacaca 
aggcgtcaccagaggcatcatcagaggcagataaaaagaggtggatgcttaatgattttgac 
attgggaagcctcttggaagaggaaagtttggtcatgtatatctagctagggaaaaaaggag 
caatcacgttgtcgcattaaaagtgctgttcaagagccagctaaagcagtcccaggtcgaac 
atcagcttcgtcgtgaagttgagatacaaagccaccttcgtcatccaaatattttgaggctt 
tatggttacttttatgaccagaaacgtgtgtatttgatcctggaatatgctgccaagggtga 
actctacaaggagctgcagaaatgcaaatattttagtgaacggcgtgctgcaacttatgttg 
catccttagcccgagccctaatatactgtcatgggaagcacgtaatacacagagatatcaag 
ccggagaatcttttggttggtgcacagggtgaactcaaaattgcagactttgggtggtcagt 
gcatacctttaatcggaggcggactatgtgtggcactctagactatttgccaccagagatgg 
tggaaagtgtggagcatgacgcaagtgtggatatttggagcctgggtatcctctgctttgag 
tttctgtatgggatgcctccatttgaagcaaaggaacactcagacacatatcgaaggattgt 
gcaagtggatctcaaatttcctgccaaaccaattgtctcatcagctgccaaggaccttatta 
gtcagatgcttgtaaaggattcttctcagcgtctgcccctaaaaaaggtcctggagcatcct 
tggattgtgcagaatgcagatccttcaggtgtttataagggctgatgaagacatcaccaatg 
actcacaatctttgtggcggactaaattgtttttgtttttcactgaaaaagcctttgctcag 
cgtta 

SEQIDN051 

gtttgtacaaaaaagcaggctggtaccggtccggaattcccgggatatcgtcgac'ccacgcg 
tccgcgggaagattctcatgcaattaaccgaatcgtcaaattttcctctaaaatataaagtt 
tctccggaaaatgtcattcatcgatgaatttcaagccaatatagaagctcttccgaaccatt 
tacggaggaaatatgccttattgcgtgatttagataaaagtctgcaaggagtccagaggcaa 
aatgagcaacgttgtgagaaagaaatagaggatatgatacagcgtattaaggctggtaacgt 
gacaccagactcttcactaatcaaattctctgatgatgcattggatgagcaaaagcatgcaa 
tccgaattgctgatgagaaagttgcattagcttctcaggcatatgatctggtagacgctcac 
attcagcagctcgatcagtacttgaaaaaatttgatgaagagctccggagagaaagagatgt 
tgctgttgttactggaactcctgctaccactgttgaaaataatggaaagtccggaaggtctg 
gtgaaggtaagggagggcgcaagaaaacacgtcttgctacagcagcggcagctacagccact 
gcagcagcagcagcaacaccaagtggaatggatttggatctacctgttgatccaaatgaacc 
aacatattgtttctgcaatcaagttagctatggtgaaatggttgcgtgcgacaatcctaatt 
gcaaaatagagtggttccactacggctgtgttggccttaaagaacagccaaagggaaaatgg 
ttttgcgcggattgtgcaggaacacaaaagaagcggaaaggcagatgatagtagtagaagaa 
aataattcagtatactgatttaagacgttttaccaccggaaaaatttatgtagatactgtac 
ttctgtaattttgttatgtgtagccattattaacaagtcactcttgcattctaattgtagga 
gggaagtacaataagtcaacaaaaaatttactcttgtttattatgaactataacgaacaaat 
aaactattgtcttttaccaatcaacatatttgtaatc 
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Ccacgcgtccgcccacgcgtccgtttcatattcttcttcttctgcttcattgttattgttta 
tagaaaaaaatccaagaatggagcagttgcaagagggttttcgttttcgtcctacagattca 
gaacgacttatgtttttgttgagattcattgctaaacaagagatgaatgattctggatttat 
cacaacaaacatcgacgtctatggcagagaagaaccctgggaaatttacaatcacggcgtat 
cctgtggtaatgaagataatgcggactacagcagtaactatcgctatttcattacaaagctg 
aagaagaaaaacaaggcgaggcataatctagaggttggaaataaagggagttggaaacaaca 
agataagggtaaatcagttcactacaaaaatacgggaaattcatcttctgtggttattggat 
gcaaaaagagcttgtgttacgtgaataaacatcagtgctataatcagagcgatggacattgg 
ctaatgaaggagtacgagctttctaatgttattcttcagaaattcgacgaagattgtagaga 
ttatgttctttgtgccatcaaaaggaagtcatgttctactgattatattgagcggccattgg 
caagggtgcagtatcaagtgaatgatttgggggactatatgcagagcaattcagggcattat 
gtggaatctgaaacggacatgacgacacagaacgaggtgcccgaattagaagttcttgatta 
tcaattagaagttcttgggatgaaaaggacttagctgatttaaattggatgttatatgatat 
gcctgtggtcgatcagacggtgaatattgtcgagcagcagaggaaccagaggtcagttatta 
ataagagtgatgaattctatcagatgttggcacaaaacgaagcttttgagttctattgatta 
actgtatagtcatattcttggtagatgatagagatttgattaacaatggcatatgtcccact 
ttgtagaatggaatttaagatagtagtacatctatatatctttgtataacagtatggcgcgc 
gcc 
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ccacgcgtccgtctatctaagcaatttccgtagttccaaacacggtctaaatcagattcctt 
gcttttttcatctcaaattctctactttttgttgcttcgcaacttggcttagatccttcttc 
atcacgctttgtaactgcttcactcaagctatttcacgatgcgatcaatgcatagatagctg 
aaaattcgaagtgcccggaagaaatggagaaggagtcacatggattaattataggcatctca 
attggagttgtgataggagtgcttttagctatacttgcatttttctgctttaggtaccatag 
gaaacgtcctcagatagggaatagcagttctaggagggctgccactattcctattcgtgcaa 
atggtgctgatacttgtacagtattatcagactcttccattggtacagagtcaccaaaatcg 
actatccagaatggcatgtcagtgtggcttggcggccttaggaaggcaaatgttgtttctgc 
ttccggtatactcgagtactcctacaaggatttgcagagagcaacctacaacttcaccacat 
tgattggtcaaggggcctatggtcctgtttataaggctcagatgtctactggtgagacagtt 
gctgtcaaagtgctcgcaactgattctaaacaaggagagaaagaattccaaacagaggtcat 
gttactgggaaggctacatcatagaaacctggtgaatttggttggatattgtgcagagaagg 
gtcagcatatgcttatctacgtttacatgagcagaggcagtttggcttctcacttgtacgat 
gaaaagcttgaacccttgcactgggatttgagagttcaaattgctcttgatgtggctagggg 
cttagagtatcttcatgacggggcagttcctccagttgtacaccgggatattaaatcattca 
atattttgttggatcagtcaatgagagctagggttgctgattttgggctttcaagggaagaa 
atgatcagtaaacatgtatccaacatccgtggaacattcggatatcttgatcctgaatatat 
atcaactaggtcattcactaagaaaagcgatgtttacagctttggggtcttactgtttgaac 
ttattgctggtagaaatcctcttcaggggctcatcgagtatgttgaactagcagccatgact 
acagatggaaaaggtggatgggaggaaattgcagattcccgtcttgatgggaagtatgattt 
gcaagagcttaatgatgtagctgcacttgcatacaaatgtgtgaatcgtgcccccaagaaac 
ggccttccatgagggacattgtgcaggttctgtcaaggatacttaaatctagacccgacaga 
aagcgtcccaagcgtttttcatctgcaacagcagaggaggttaccatcaatgctgaacaacc 
agattatcggagtccaaactctggaccccgacgaggggaatctatggacagcccagctgact 
catgtgaagtttaacccagttcttccatttgtttatttttttttttttaatttcttcctctt 
cttttcttcttgtaaaattggtcaggttgttaggttctccattcataacacacttctgtctt 
ggtgcgttcgattggggtacttaggatctgttatagtctgcgtgtaagatagcctttctttc 
tttccaattttgttaaatttttgtaaatttgcgtggaaggtaaccgaatggcagaaggaaag 
ggtgaaaagcccagatcagccttttgtcaattctatgaaagttcatatatctttccacaaaa 
gtgcacgg 
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ccacgcgtccgccagcaagcacagtcgtccacaattagatattgatctgaatgtaccagatg 

aaagaacttttgatgaaataaattctcgagattctgctctagagttgatctctccattggac 

catatgactaatcgtgctgcactgaagaatgaagtaattgattctcctgctgttcgctgttc 

tggaggactcgatcttgatttaaatagagttgatgaacctggtgatgtagggcagtgctctg 

tgagtagcagtagtagattggatggtgtagttttaccttccaaaacatcatcatccattggc 

ttgccaactggggaggtgaggagggactttgatttaaataatgggcctggtgttgatgattc 

cagcgcagaacagtttttattccacgataatcatcagggaagcatgcgttcccaactgcctg 

cttctagcctcagactgaacaatccagaaatggggaatctttcttcttggtttactcctggg 

aatacttattcaactgtgacacttccatcaattttgcctgatcgtgtggagcagccgccatt 

cccaatggtcacacctggtgcacaacgaatattgggtcctcctgctgctggttctcctttca 

ccgcggatgtttaccggagttcagtattgtcatcatcgcctgccgtgccttacccatcctcc 

ccttttcagtatcctatatttcctttcggaacgagcttcccacttccttctgcaacattttc 

agttggatcagcttctttcgtagattcttcctctggtgggcggctttatacgccccctgtaa 

attcacagttgctgggtcccgttggcgctgtgtcatctcaatatccaaggccttatatggtt 

ggacttcctgacagtagcagcaacggtaccatggatcacaatagaaaatggggaaggcaggg 

tctggatcttaatgcaggccctggagtggtggacatggaagggagagaagagtcggtttctt 

tgtcggcaaggcaactctctgttgccggttcacaagcattagcagacgagcatggtagaa-tg 

tatgctgtacctgggggtgttctgaagaggaaggagcctgagggtgggtgggacagtgagaa 

cttcagattcaagcagtcatggcactaagatctgcaatctggtgattttataagctactgga 

ggatggacttggctaactcctcaaactctcagcttctggcatgctcctgtgggtgggcggta 

agtgagcaapitttgatgtgttcagagtctccgaccaccacctcttcagcttatcagtgtagt 

tgggatttccatggtttgcaagcactgcactttggtcagctatattctctgggtggatgcag 

atgagttttccctctgtagatatttaactgttggaaagcttgaaatctttgatgcccaggga 

ctggggataaatcaatgttatcctgtccaaattattgacaatggaggtccaatttcgagact 

gaatcaaacggaaagcttttctttgtgctttgctgttaatcatctttcaatgcttcccgtgt 

tcttggcttttctctgtcctcctttgcccattacatatgtatacagggttgacaccaaattt 

tggtactaatgctttcatcaggcatgttttagttgttgtggctgccattgtaccataaatta 

aatcgttctaacgttagtttgtagtctcattcacagatgatagaactcttgttaatgatatt 

ttcaatgatggtggggtgatgtgcttgtttttctttcaagctactaatctgaaccaacagtc 

ttgtgagcaacgaaaagacaacttctgttttctgatttggagaaattaaatgggtggagctt 

ttgcatgggttaaaaaaaaaaaaaaaaaaaaaaaaag 
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ccacgcgtccgcacaattcttctacagtacaagaaaccaaaaaaatggcgagtcttaaagtt 
ccaacatctgttccagaaccttatgaagatgctgagcaactcaaaaaagcttttgctggatg 
gggtacaaatgaggcacttattattcagattctggcacatagaaatgcagcacaacgcaagt 
taatccgagaaacttatgctgcagcttatggagaggatcttctcaaggacttggatgctgaa 
ctgacaagtgattttcagcgtgcagtgcttctgtggactttgagtcctgctgagcgcgacgc 
ctacttggttaatgaagctaccaaacgtctgacttctagcaattgggttatcttggaaattg 
cttgtacaaggtcttctgatgatctctttaaggcgaggcaggcctaccatgctcgatacaag 
aaatcacttgaagaagatgttgcttatcacacaactggggatttccgtaagcttttggttcc 
tcttttaactgcattcagatacgaaggagaagaggcgaacatgacattggcaagaaaggagg 
caaatatactacacgagaagatctctgacaaggcttacaatgatgaggagctcatccgaatt 
atttctactaggagtaaagcacagctgaatgcaacattcaaccactaccttgaccaacatgg 
cagtgaaatcaacaaggatctggaaactgattctgatgatgagtacctgaaattactcagcg 
cagcaatagaatgcttgaaaaccccagagaaacactttgagaaagttcttcgattggctatc 
aagggtacaggcacagacgaatgggaccttactagagttgtcactactcgggctgaagttga 
catggaacgtatcaaagaagagtaccataagaggaacagtgttccattggaccgtgcaattg 
ctggagacacttcaggagactatgaaaggatgcttctggctttgattgggcatggagatgct 
tgaatggaatatgtgttctaagattggataagaaactatttcctaatgtctgaagtttgaat 
ttgtttgatgatgtgtggcatgtatgcccagagtttggtttgcattatatgggatttaaata 
atccaggtgttgtgttttggtttttaaaaaaaaaaaaaaaa 
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ccacgcgttcgggataacatcattatccttctctcctcttcttccttctttcaaccacaatt 

ctcactcccctctttcgtctctcttctccaacttcaatcccattttcaggcaaaaagctgtc 

atggcttcaatttcagcagcttctgccacagctacagcttctacaaagcttgcataccctta 

ttccccttcttcctcaagcagcagcagcaacactgctgctgtattcccttcaaattcctcaa 

agcttatcctttcctcttcttttacacccaccccttcaacccttttcctccactcaccaaca 

actactccttccaccacccacccccgtcggttcactgtccgcgctgcacgtggcaaattcga 

gcgtaaaaaacctcacgtcaacattggtacaattggccacgttgaccatggaaagaccacac 

tcacagctgctttgaccatggcgcttgcctctatgggcaactccgcccccaagaaatatgac 

gaaattgatgctgcccctgaagaaagggcgcgtggtattactatcaacactgccactgtgga 

atatgaaacggaaaacagacattatgcacacgtggactgcccggggcatgctgattatgtca 

agaacatgattactggtgctgcccaaatggatggggcaattcttgttgtgtcaggtgctgat 

ggcccaatgccacagactaaagagcatattttgttagctaagcaagttggggtccctaatat 

ggttgttttcttgaacaaacaagaccaagttgatgatgaggagttacttgagcttgttgagt 

tggaggtaagagaattattgtcaagttatgagttccctggtgatgaaattcctattatttct 

ggttctgcacttttagctttagaggctttgatggctaatcctagtattaaaaggggtgaaaa 

tcaatgggttgataagatttatcaattgatggataatgttgatgaatatatccctatcccac 

aaagacaaactgaattgcctttcttgatggctattgaggatgttttctcgattaccggtaga^ 

ggtactgtggcgacggggagagtagagagagggactgttaaggttggggaaattgttgatat 

agttggattgaaggatactaggaatactacagtgacaggggttgagatgtttcagaagattt 

tggatgaagcgatggcgggagataatgtgggattgttgttgagaggtattcagaagattgat 

attcagagagggatggtgttggcgaagcccggaacaattactccgcacacaaagtttgaagc 

tttggtgtatgtgttgaagaaggaagagggaggaaggcattccccgttctttgcgggttata 

ggcctcaattttacatgaggacaactgatgtgactggaaaggttactgtgattatgagtgac 

aaaggagaggaatctaagatggtcatgcctggcgatcgtgtaaacatggtggttgagcttat 

catgccggttgcatgtgagcaagggatgaggtttgctatcagggaaggaggaaagactgttg 

gagctggtgttattcagaaaatcttagaatgatgaacttgcagctgagcatctcttttcaca 

tgatcggcactttccattgaagttacttaatccattgtcatatatgcaacttcttggttact 

tttattatgtcttagaatcttactttagtagaagtatcctgttttaaacaccaaattctact 

gaacttttgggatttttcctcagtctcctctttcatttttcctttgcttgaaaggaatgaga 

acatttgatttcatgcactttatttaatttagaacaaatgtgcgactctgtttaaaattaag 
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ccacgcgtccggtttttagctctgtttttgacacctataaaatgcccctctgcttcattgaa 
ctatctccttcctcattctattgacacataggaagaagaggggcgacttgttgtgtaaaaga 
gaaaaaaaaaatgtatgcagagacagggctaatgttcccttattttcagactttcccttctg 
aagttcaacaatttgaagacttctgttcctctcacgaacctaatgcgtcaatgggatccaac 
atatcggaatatgacctcgggggagaaggggatctctttaaagctccacaaccaattattga 
agaaccattgatgggccttgatcctatgactgctgctatttcaatgatttcttgtgcagaag 
atgccatctcgccgcaaggactcaaagtttcggatctagaaacttcgtttgagaatgaacaa 
ctcctgagtgaagttttctatgaatgcaaaaaagacctatttgacaaagatgcaattgatat 
accgttctctgaagtcttggacatgaaaattcctattgtgaaggccgacgaaaacctgactg 
cagatgagaacttggtttctcaagtatctttccagaaaagtattagttcagggtct'ttaacc 
tccatggattggatacacggggcttcaatgaggcccaattttatagattttggtggaatgga 
ctttggagctgtttatggtatgcgaagagcatacagtgaaggagacataaagactctgggta 
atggcaacataaatctgatccattctccactgggtcaaccacagattgtcggatgctccact 
tctgaaattcgcaaggaaaagctctccagataccggagcaagaagaataaaaggaattttgg 
cagaaaaatcaagtatgcttgcaggaaggcattggctgatagtcaaccaaggatccgtggaa 
ggtttgccaagaccgaagaaagctacacatcgaagaagcattaacagttttaactgtctctg 
agttggaagaattatagtaaggtagttcactggttatattagctgatgatgatataaatagc 
aaatggaagctagctttagaacaggatctgctcaaataagttggggatccatccatccaaca 
acttgctagtttgttaaaatctttggggtagcggcaataatctttgtagattagacaaatca 
actagtgttgtatatagtgtttgttaaataaaattctgtag'cttgctattaatgctggataa 
tgtatttccgatatctctatgttcagcggtccagaccgttactctgtatcttactgacgca 
caatttctgtatcttactgacgcaacaattataactatgcttcagtgtatcaag 



a 



SEQIDN058 

ccacgcgtccgctcaataactaaatatatatatagctcagattaatttatcaagaccttgtg 
aagatgaagacttctacttttcttgcaatgttcttggtcttaacgttggttctccaagggga 
atttcaggcgagcgaggcagtgacatgcagtgcctcgcagctaagtgagtgtgtgggggcgg 
tgacgtcgtcacaggcaccatcttcggcatgttgcagcaaaatgagggaccaacagccttgt 
ctgtgtgggtacatgaaggatcccagcctgagacaatatgtcaatagtcctaatgctagaag 
ggttgctaacgcctgtggagttgccgttcccagttgttaaaatacttatgtgtgcaaaagta 
aaagcctttattaattactgttgcttgtactaagggaattataagcctatgttgttggcccc 
ttttacctaaataaaaaaggttgtgatgctaaaaaaaaaaaaaaag 
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ccacgcgtccgctcgcgttttagcagccactggagaaatcaaataggagagagaaggtagtt 
tctagagagagaaaccaaacaaaacaacaccagtttttagagagagaaaaaaagtaaaccgg 
actattctcgaagaaaattttccggtgactgtataaattattttcaagtgaagtttccatat 
ttgtacactcattgtcaattgattgcgttgccgtctccagattctccattaccgatttggta 
attaggttttcgagatcggttggttatcatccttcgattcgttaattcgggttaacaggaat 
tttttggtttcgattcgataatcgggttcaagtatttcagaaagagaacagaaaaaggaaaa 
aggaggtcttaaatctgtttggaagtgaagggggttttggttgaaagatgttgaccaccatg 
gttggtttgtgatgtaatatggcacgggttgttacagataaagatatgtcgttttacattgg 
tcgcgaggcttcaaagttgtggaagagattttgtgcggagataacaacagagatcaatcttc 
ttgctgagaattggaaatatattcttggcggtttgatttgtcagtacatccatggacttgcg 
gctagaggggtgcattactttcatcggcctggaccaattcttcaggacgtcggcttctatct 
tcttccggagcttggacaagatagagcttacataagtgaaactttatttaccaccatctttc 
tatcttttgtcttgtggaccttccatccttttatttttaagaccaaaaagatctatacagtt 
ctgatatggtgcagggtcctggcattcttagtcggttgtcaattccttcggatcataacatt 
ctattctacgcagcttcctggtccaaattatcactgtcgtgagggttcaaagcttgccacgc 
ttcctcctcctgacaatattttagaagtgctattgattgttcctcggggcgtgctttatggt 
tgtggtgatctcatattttcttctcatatgatattctctcttgtctttgtgcggacatacca 
gaaatatggaacacgaaggttcataaaacagtgtgcttggttagctgttattgcacaaagct 
tattgattcttgcatcgcgcaaacattacactgttgatgttgtggtggcatggtacacagtc 
aaccttgtagtgttcttcattgataaaacgttgccagaacttcctgatcgcactagtgcctt 
gttgcttccagtgaccaaggatagcaagtccaaagaagagaatcacaaactgctgaacggaa 
attctggagatcctgcagaatggaggcctcgaaacgggaagatcgtggaagatgggaaaaca 
gtgcacgttgaagcagtaattaatggtgcatagacgataaacttcatgcaacaccactaact 
gatgcttgcgaccttggtacagagattggtaacaatgccattataagttgtgttaatataaa 
tcgttctgggtgttcttccaagttcaatagttttggttttagcgtaggatacgaaatcaaag 
attgagatgctatcgatgtctccacggtcctctgattttatcaaatgtatcatggaatttat 
tttattttttggttaatgcaatatttcccatccg 
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• SEQIDNO60 

ccacgcgtccgatagatatagatacagaagagagagagagagaggtggtgggtgtgaatatg 
gtatagggcctgagacccctgaaagggcatggaagtgctggccatgctgaggcaactcattg 
gacaagttaaacaactcttacaacaacaaaatacacactctccttcttcttcctcctcctct 
tcttcttcttctaacttctcttttcctcttcaatcgccaccgcttttacacctcccaaggtg 
ttatgttctgaatcttgatgacagttctgctgaagacagttgctacaatatcattatgactg 
ctggaaaatctgaaaatctcaagatgttggaa'cctggcaagcctccaccaaaaaagaaagct 
cggaaggagaggaatcgaggaaaagtgactggaacttcatgctccatagagaatttggatcc 
gcaaatatggaaagaatttcctgaagacttatttgaagctgttattgcaagactaccaattg 
ccacttttttccgcttcagatctgtctgccgcaaatggaactcaatgctgatgtcccaaagt 
ttttctgaacagtgcacccaagttcctcaaccacaaccgtggttctacaccat tact cat ga 
aaacgtgaatactggagccatgtacgaccctatgttgaagaaatggcaccatcctactatac 
ctgcactgccgaccaagttgatagtcttgccagttgcttctgcaggaggtcttgtctgtttc 
cttgatattggacataggagcttctacgtatgcaaccctcttactaggtcctttaaagagtt 
accagccagatctgttaaggtgtggtctcgtgtggcagtagggatgacattgagtgggaaat 
cagcttacagtatcctttgggttggttgtgatggtgaatttgaagtttacgactccagaaag 
aactcttggactcgtccaggatctatgtcctcaaatgttaagcttcctatggcactcaactt 
caagtcgcagacagtcaccatcggtaataaattttactttatgcgctcagagcctgatggaa 
tcgtgtcctatgacatggttactgggatctggaagcagttcattatccctgcacccctacat 
ctgagtgatcatacactagcagaatgtgggggccgcataatgcttgtcgggc.tgctgacaaa 
gaatgcagccacttgcgtgtgcatatgggaactgcaaaagatgactcttttgtggaaggagg 
ttgacagaatgccaaatatatggtgcttggagttttatggaaagcacgttcggatgacttgc 
ttgggtaacaaaggtttgctcatgctatctttaagatcaagacaaatgaaccggctagtaac 
gtatgatttctcaaccggagaatggatgaaggtccccggttgcgtgttgccccgtgggagaa 
agaggcaatggatcgcgtgcgggactgcttttcacccccgtcttacagctttggcttaactt 
gggatgcccagtaaatttctagtcacagcagagtgcgatttattatatcatgtggttttagc 
ttttcccatcataatctgcagcctagtgttctctttgctgaatttattaccactctcttgta 
taaacatctagttgttaagcttttcattccagaggactaatctacgactacttattattaca 
ttaaaaaaaaaaaaaaagggcggcc 
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SEQIDN061 

ccacgcgtccgccaatatcagatttctttcatgaactccacttccaatttctcattgcttct 

tcttcccatttccacctccaaagccatccttccagaaaaccttgttccttacatttcttagc 

cccaaaaaagattcccatctcaattccacaaaaaaacacaaggagatctaaggaaattcccc 

gcctctatatatagagaggtggaattgttcctgaatttggtttgaattgattgattgacaga 

ttttggtgagagggtgttattgaaaaaatgggtgacatgaaggataaagtcaaagggttcat 

gaaaaaagtcacatcttcttcttcaggtaagtttaaaggccaaggtagggttttgggtggtt 

catcttcttcaggaccctcaaatcatgtcaataatttttcatcacatcccctaaatacaagg 

caagatcaacaaccttcatatacaaaaacttcgcctcaaaaaccaagtaattctgatcaaag 

aattgagaatatatgtgaaattcagttcaacaaaagtgaatcaaaggatggttttgatccat 

ttggtgaattagtcacttctgggaagagaaacccaaaagggtattcacttactaatgtgttt 

gaatgccctgtctgtggtagtggttttgtttctgaagaagaggtgtcaactcatattgatag 

ctgtttaagttctgaagtgtcttctaatttgggagttgaaagtaaagttgaagttaaaagtg 

aattggaaacatgtgttagtgcatatgtttcagggaagccctcagaagggtcagttgaagtg 

gtcattaagttgttaaagaatattgtgaaggaaccagagaatgccaagtttaggaaaataag 

gatggggaatccaaaaataaaaggtgctataggtgatgttgtaggaggagtggagctattgg 

aatttgttggatttgagttgaaagaagaaggtggggaaatttgggctgtgatggatgttcct 

tctgaagaacaacttgttatgcttaagaatgtagtttcactcttggaaccgaagaaggttga 

agagttggcgtccttatcccaagttaaggcgagtgaaccagttgagccgaagaagattgata 

gacagattcgagtgttcttttctgttcccgagagcgtagcagcaaaaattgagctacctgat 

tccttctttaacctctcacgtgaggaattgagaagagaagcagagatgaggaagaagaaatt 

agaagattccaaattattgattcctaaatcttatcgggaaaagcaggcaaaagctgcaagaa 

agaagtacacaaaatccattatccgtgtacagtttccagatggagcattgcttcaaggtgtc 

tttctaccttcggagccaactagtgctctttatgagtttgtgagcgcagcgttaaaggaacc 

aagcttagagttcgaattgttacatccggtgcttgttaaaaagcgggtgattccccattttc 

cagctgctggggagagggctgtaacagttgaagaggaggatttggttcctgcagctctact.c 

aaatttaaacctatcgaaacagattctgttgtttttactggtctttgtaatgagcttcttga 

aattagcgagcccctcgagaccggatcagttgcttcctcgtaagctctaaattacatcagac 

tttgaattcttctgagtgttggaaaccttataaaactctctgcgccgggaatgct 

SEQIDN062 

ccacgcgtccggactttctgaccttgtcaaaaacctctgtgtttctctcacatttctggtcc 
caatctcttgatatttattggagaagacgatggcagctccaccagctagggctcgagcagat 
tatgattatcttatcaagctcctcctcattggcgatagcggtgtgggaaagagttgtttgct 
tctgaggttctcagatggttccttcacaacaagtttcatcaccactattggaattgacttta 
agataagaacaattgaacttgatggcaagcggattaagttacaaatttgggatacagctggt 
caggagcgtttccgcactatcacgacagcgtattatcgaggagccatgggtattctgctggt 
gtacgatgtcacggacgagtcatctttcaataacatcaggaactggattcgcaacatagagc 
agcatgcttctgacaatgtcaataagattttggttgggaacaaggctgatatggacgaaagc 
aaaagggctgtgccaacttccaagggtcaagctcttgctgatgaatatggcattaagttttt 
tgaaacaagtgcaaagacaaacatgaatgtggaagaagttttcttttcaattgctagggata 
tcaaacaaaggctttcggaatctgattccaagactgagcctcaggcaatcaggatcaaccaa 
tcggatcaggccggaacttctggtcaagctgcacagaagtcatcttgctgtggttcgtgaat 
ggagacaatcgtgtgggaagaacgttcgttagttgcatttggatgtaaaaattgattgggat 
gaaaaactgattcctgttaacttcattaccaaatatttcttcgccatctgatggcaagcttg 
atgtgtcaaaggcttttctactgtcgttgtgaatctattgtcatgcagttaactagcctgcg 

ttttgataaaaaaaaaaaaaaaaaaag 
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SEQIDN063 

ccacgcgtccgaataaatctgcttttggaaacattgtgttgcatccttctcttcagaggaga 
atagaacacctcgctagggccacagcaaacaccaagtctcaccaggcaccatttcgcaatat 
gctcttttatggtcctcctggcactgggaaaacaatggttgctagggagatcgcaagaaaat 
cgggtttggactatgccatgatgactggaggggatgttgcacccctgggtgcacaggctgtc 
accaaaattcacgagatattcgattgggccaaaaaatcaaataaaggcctactgcttttcat 
tgatgaggctgatgcatttttgtgcgagcggaatagtacatacatgagtgaagctcagcgaa 
gtgctttaaatgctttactctttcgaacaggggaccagtcccgagacgtagttcttgtcctt 
gcgaccaacaggccaggagatctagacagtgctgtcactgaccgtatagacgaagttatcga 
attccctctccctcaagaagaagagcgtttcaaattgctgaagctctatttgaacaagtacc 
ttgctggtgaaggagacagtgacagcaattctaagtgggggcacctcttcaagaagaaccaa 
caaaagaggataaccatacaagatttgtctgatgatgtgattagagaggctgctaagaagat 
agaaggattctctggccgtgagattgcaaaacttatggcaagtgttcaagcaactgtatatg 
ggagcccagattgtgttcttgattctcaactgttcaaggaaatcgtagattacaaggtcgct 
gagcatcaccaacgaataaaactagctgctgaaggtatggagccaacttaccaggggaatta 
actgacaccacaaagatacaagtgtctttcactgatacgaattgttgaaaatttgtttatta 
tctctttggtagtattgcatgcaaaattcattttttccaaacttaggatattgtagtttagg 
tgtactatttctgcttggggaatgagcactggatggtggacgtgtttcagggttcaatggga 
cgttacaatttgatgggtacatagctcacttgggctgtaattgtattgattctgtggatcgc 
aggaaaatacatccattgaatagataaatagtaggcaaaacatgaagtctctttgaaatagg 
tctctgttatcaaatatcaactaacctatcttttgattacc 

SEQIDN064 

ccacgcgtccgtatcttaatccgactccatctcctatctatctctcatacacttaacataaa 
tccacaatcaaattccccactataacacacacccaaattataaagagagaaatttttcgttc 
tgtggtgtttattattgtttgtgggttttgtaaataaatg^gggtcagaatcagatgagaggg 
aggtgatattgggtgtagatgggggcaccacctccactgtgtgtgtttgtatgccacttctt 
ctcttttccgaattccctgatcctcttccagttctgggccgctccgttgctggttgttccaa 
ttttaatagcgttggagaagatgtagctagagaaacactggaaaaggttatggcagaagcat 
tgcttgatgctggtgtgaaacgatcagctgttaaagcagtgtgtttgggtctatccggtgtg 
aaccatccaacggatcaggagaaaatattaggctggttgaggagtgcattcccaagtcatgt 
taagttgtatattcagaatgatgccgtggctgctctagcaagtggcacgatgggaaaacttc 
atggctgtgttttaatagctggtacaggaagcatttcttatggatttactgacgatggaaga 
gaagctcgggccgcgggtgcagggcctgttttgggtgattgggggagtggctatgggattgc 
tgctcaagcattgattgcagtgatgagggctcatgatggtcgaggtccacaaacaatgcttt 
cgagttgtattctacagtcactaggtctttcttctccggacgaactaatagggtggacctat 
gcggatccatcttgggctcgcattgcagcacttgttccagtagttgtatcctgtgcagagga 
tggagatcaacttgcagacgagatcttacataatgcagttcaagaattggctataagtgtca 
aagctgttgtccaaagactacgcttggccggggaagatggaaaaggttccttccctgttgtt 
atggttggaggcgtacttggagccaacaataaatggaatatagggaatgaagtcactaattc 
tattttaaagacttatcctggagcttgtgtaattagaccaaaggtagagcctgcagttggag 
ctgctttattggccttaaatttcttgatgaaagaaacagtagctaatggccatagttgacac 
ctgattgtacatagctaactgtgttaactgtataatcattgaagttctctttaatcggtggt 
tccaattctgggagggcatgtccttggatcatggtactgtacttgccttctctttccattgc 
atatgcagactgctaaaaatgatctgttattcaaatgaacgttgcaccaacttgttgtaaca 
tatctttgtttcctaagttgggcagtcttttggtgctggaggagagggaaggagattgtttg 
gtcatagttgcatttgtattgctgatggttatatagaattcataactgatcagtatgttatg 
taatctcttttatagcattctctgttgggataaaaaaaaaaaaaaaaaaaaaaaaaaag 
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ccacgcgtccgcaaaatttagaaccccaaaacaatcagtactcctcactcccaatttggccc 
caatttgaattcaaaatctggaagcattcatgtgactttttcattttttcaaaaactctctc 
tcttctctccctccactctctgtgaaaccctagacacacactccatacgctctcgcaacctc 
tacctctctcttaaatcagcaaacgacagcgatctcatgacggcggtgccgcagtctgccgg 
aagagagctcgcaagcccacccaaggacggcatatccaacctccgattctccaatcacagtg 
atcaccttcttgtttcgtcttgggataagacggttcgtttgtacgatgcaagtgcgaatgcg 
ttgagaggagagttcatgcacggaggtccagttctcgattgttgcttccacgatgattcttc 
tggattcagtgctagtgctgataataccgttagaaggcttgtgttcaactatggaagagagg 
atatcttgggaaggcatgacgcaccagttcgctgcattgaatactcatatgcaaccggacaa 
gtgataactggcagctgggacaaaaccttgaaatgctgggatcccagaggtgcaagtggaca 
ggaacgtactcttgttggaacgtatacacaaccagagcgtgtttactctctttcccttgttg 
ggaaccgtttagtagtagcaactgctggaagacatgtgaatgtctatgacttgcggaacatg 
tctcaacctgaacaacggagggaatcttccttgaaatatcaaactagatgtgtgcgatgtta 
tcccaacggaacaggctatgctctaagttctgttgaaggtcgggttgccatggaattttttg 
atctctctgaggccggtcagtccaagaaatatgcatttaaatgtcaccggaaaactgaagct 
ggaagggacatagtctaccctgtaaatgcaattgcgtttcaccctatctatggtactttcgc 
cactgggggttgtgatggttatgttaatgtctgggatggtaataacaaaaagaggctatatc 
agtaccctaaatatccttcaagcattgcagcattgtcatttagcagagatggtagactcctg 
gctgtagcatcaagttatacatttgaagagggagaaaagccccatgagccagatgccataat 
tgtccgcagcgtaaatgaagttgaagtgaagccaaagccaaaggttttgccgaatcctacct 
catgaaaactatttcagaagctcctcgatcctctcgagtcgactagtttatcttactttgga 
aaacaaaaaaactcttatgtacttaatatttcaatttgacttccaggactcatttctcgtag 
ctggaaattctggagaacagtgataaatttgtaattatccagttagcaattgtacctttttc 
gatgaaaaaaaaaaaaaaag 
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ccacgcgtccgtcgcctacatttgagcatgtcctgccccctcttcgtagctgcacctctcct 

ctgaaactcgagaaaaagtacaacaattttaagcttcagagatgctcgaagtatatagaagc 

agctcagttgagtggaagccgtcgccagtagtagccctagctactagcgccgacgattctca 

ggtcgccgcagctcgagaagacggctctcttgagatttggcttgtttctccttgctccgtcg 

gctggcactgtcaacttataatacatggaaatcctaattctagggtttcttcgttagtgtgg 

tgtaaatcggggtcgagaaggttggatgcaggtcggttattttcgtccagcattgatggatc 

agtttacgagtgggatcttttcgatttgactcagaaggctgtgctagattccattggtgttt 

caatatggcagatggctgtggaaccatgcaatacttcgcagcttcatcaaaatcttccaagg 

cagtatgagaatggccatgttaatcatacaaatggtgttagtagtgataatgagagcactga 

aggtgaagatgatgatgactcggttgttcttcatgaggatgatgatagtgaaaatggtcaaa 

ttgcatttgcttgtgacgatggttgcgttcgaatctacactactgatgagaagaatatgact 

tacaaaagatcattgcctagggtcagtgggcgtatattgagcgtcacttggagttctgatgc 

aaagaggatattttctggtagtagtgatgggtttataagatgttgggatgccaagttagcat 

atgaagtctataggataacagttggacttggaggtttgggtagtgaatctgaactatgcata 

tggtcattgcttgcgttgagatgtggtaccatggttagtgcagatagtactggtagcgttca 

gttctgggacacccatcatggcactcttttgcagtcacattcaagtcataaaggtgacgtga 

atgctttagcagcatcacccagccataggagggtgttctctgctggttctgatggtcaggtt 

ataatttataagctctcaaccagtgaggttgggtctcatgatggagatatttcttctgtaga 

catgaagaaatgggtttatgttggttatgtgagggcccatacacatgatgtgagggccttgg 

cagttgctgtacccattgctcatgaagagcccgtagctgaacataaggcaaagaagcagcgt 

tccggggagaagccccttgattttagttaccataaatgggcacaattgggtgtaccgatgct 

tatctcaggtggtgatgacactaaactttttgcatactctgtaaaggaattcaccaggtttt 

ctccgcatgacatttgtccttcacctcagaggccacctatacaacttgcagtaaatacaatt 

ttcagtcaggcttctttactcttagtccaggctgcgtactggatagatattttttgtgttcg 

tgtaaaaaagggcgttgtgtctgatagctgtggccagtctggcggggctgcgagaacagatc 

tagtggctcgtgttaagtgcaaaacttcgaggaagatcacatgcagtgcaatttctccttca 

ggtgtaatgtttgcttattccgactatgtaaaaccctgtctttttgaacttaagaagagtgg 

tgctagcaagagtccatggactgtcagccgaaggcagctccctctgggactgccatttgccc 

attcaatggttttcagtgcagattcttctcgaatgatgatagcagggcgtgacagaaacatc 

tatgtggttgatgctgtaagcttggaactagttcatgttttcacacctcgtcgtcaagagca 

ttacgaagaattgctaccaaatgaacctcccattaccagaatgttcgctagtgccgatgggc 

agtggttagctgctgtcaactgctttggagatgtgtatatatttaatcttgagacgcagagg 

caacattggtttatatcaagattgaatggttcttctgttacagcgggtggttttactcctcg 

aaatagcaatgtgcttatagtatccacatcttcgaaccaagtatatgcctttgatgttgaag 

ctaagcaactaggagaatggtccaaccggaatacattctccctgccgagaagatttcaagaa 

tttcctggagaagtgattgggctttcttttgctccttctgctaattcatcatgtgtgattgt 

ctacagttcaagggcgatgtgcttgattgactttgggttgccagttggtgatgatgacgata 

ccaacttagctaatggtcaagatttagctttgacgaagctacatagtactcctgcgaatggg 

accttaaagcgcaagccgatagggaatgacttagatatgaaacaaaatggtagaaagaattt 

tgaattctgtgcattcagggatcctgttttgtttgttggacatctttcaagaacttccacct 

tgatcatagacaaaccctggattcaagtggttaaaactcttgatgcactacctgttcacaga 

cgtatttttgggacataaatctttatcacagtttttgttacagctttactaggaaacgttcc 

gaggggtgtattcaacccctttcactcatatattcttctttgttgtttgttgaagttcgggt 

ggggaaaaagttgaaatcaacactcaagttcaatatagcttcacttcatccgcaggagttct 

cctatggaaattgcgtagacctgtaaatatacttatgagctttaactagtgtccattagtct 

gttcagatattgattaatgttttcctgtataacatttattcaag 
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ccacgcgtccgcctctgagtacccttgagaagtcagagatcgatcttattcgctgccgggag 
atctgattacttcacttgttttgttcttttaggaaagatatcggatctgaccgtcaaacaaa 
gtaaaagatgcaggatcaggaagggcatgtggctgatgcaggaaaagaaacattgacatctg 
ttcaaacatctgaaattgaagattggacaaaatacaaggatgatgatattatgcaacagcaa 
tcttccatccaggctgaacaagctgtaaaaactcaatttgttggcgataaggaacctttgtc 
ttcattagaagctgaataccatctgggaaattcaattttgctggagaaaataaaggtgctga 
gtgaacaatatgctgtccttagaagaacacgtggagatggaaattgcttt-ttccgcagtttc 
atgtttggttaccttgagcacattctggaatcacaagatcaaagcgaagttcatcgcattaa 
agctagtattgaggaatgcaaaaagacacttcaaagtttgggctacgcagaattcacatttg 
aagacttttttgcgttattcctcgagcaactcgataatgttcttcaaggtagcaaagattcc 
ataagtcatgaagaactcctacgcagaagtcgtgatccgtccatttctgactatgttgtgat 
gttcttcagatttgtaacatctggtgaaataaggaagcgctcggagtttttcgaaccattta 
tact aggactaacaaatgcctcagtggagcagttttgcaagt cat cagtggaacccatgggc 
gaagagagtgatcatgtgcagattatagccctatcagatgcgttgggtgtaccaatccgtgt 
cgtatatcttgatagaagctcatgtgagaacaacagcatcaatgtaaatcaccacgactttg 
ttcctacaagcgatggcatggggaatagtggtgtttccaagaccacaaatccatctatta.ee 
ttgctgtatcgcccaggacattacgacattctctaccccaagtgatgttcttcatttagggg 
tcgtttggtttgaatacagtttatgtcgggataagttatactggtataagttatgctgggat 
tagttatgctaggattgttttttatccattgtttggtatgttgtattaaatatgacaattgc 
ataatctgtaagaagattgtataccggtgctaattaccccaccctcgataaggtataagtta 
tcccggtgttaattttaatcctgggataacttatacgtggtttgctaaccaaacgaagtatt 
aaggtggcat 

SEQIDN068 

ccacgcgtccggaagaaacgaagccggagaagagggctcttcttttcgtggagaagaacaat 
tataggagtattatcttatacttattcttaccaaagatggatcggtaccaaaaagtggagaa 
gccaagggcaggaacacccattgatgagaatgagattcggattactagtcagggtcgcatgc 
gcagctatatcacctatgctatgaccttgcttcaggaaaaaggatcagatgagattgtgttc 
aaggcaatgggcagggcaatcaacaagacagtgaccattgtggaattgattaagaggaggat 
tgttggtcttcaccaaataacgtctattacatccactgatattactgatacatgggaacccc 
ttgaagaaggccttctacctctcgaaaccaccaggcatgtctcaatgatcacaattaccctc 
tcaaaaaaggagctggatttgacttctgtggggtaccaaccaccattgccagcagaccaggt 
gaaagtgttgacagattttgactatgatggaggatcacctagtggtggacgaagaggccgcg 
gtggtagaggaaggggaaggtctagaggtttctcaggaaatggctttatgttggctgaatac 
gatgatggcgggtttgatcgcaatcggagctatggtaggggtaggggtcgaggcagaggtcg 
tagcttccgtggccgtggaaggggagggtacaatggtcctcaggatgcccagcaagacgctg 
acttctacaatcaagaagcacccatgcagggccgaggccgcggacggggaaggggaactcgt 
ggtaggggacgcggtttcagaactaatgggccgatccatggcggtggtgcttaaagatcaaa 
ctttgaagaatacagagattatgtgctatgagtgcctgctccatgttctatgttttttttcc 
cttcagttgttacccgtgttaacagtaggttattgatctgtaatcagagtagactaattata 
gatttcattaccgcccgtatgtggtgagtttttttgttttttttcttgatatcttctagtat 
tttctttctggtagattaggtgcttgatcaagtgtaatttccttagtgagcagcacattctt 
taatttgtctgtgttagacatgttcagtgttgacctcagtgcgtaaatttgcctctgttttt 
agttggcagaatactcaaattacataatttctgctgcgttttatacttctttaactattgaa 
agtctttgcttttacaaaaaaaaaaaaaaag 
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ccacgcgtccgaacagaagctgatcttgtatatatcgtagacgatgacatgattccagggaa 

gaaaatgttgcaaattttagcacatgtagcagggatagacaagtacaagaattctgttttgg 

gaagcattggtaggattttgccatttagacagaaggattttacttttccgagctataggaag 

tttcgatcaaaggaagcagggctttatttgcctgatcctgctta'taatatcactgttgacag 

aattgttcaggtggatttcctctccagttcttggtttctttcttctgaactagtcaagacac 

ttttcatcgaaacgcccttcactttcatgacaggagaagacttgcacttaagctatcagctt 

cagaagtatagaaatgctggatcatttgtgctgccagttgatccaaaggacaaagaaacttg 

gggtgacagtgagcacagacttgcttatgtatccgaaaccactgttatattcaaggacactg 

ttcaagtccgagacaatcaatggtggaaagcactctccactggttatgtaacacaatgggca 

gcaatgaatcctcagaaaattgatgcacttttctatgcccactctgtcgatgaagttaaagc 

tctcgcgcctcttcttgagaaattcaggtcaactgttggaaagaaggcctacattgttgtct 

caggaggcagcttctgcccgtgcgaagatgctgttacagctttgaactggcctaaggttgta 

tgcaaagaaagaagattcaagattatggatttaggagttggtgctctatcaagtatttcaaa 

ttcagaagtgcccgtcgttcaagcagtctatgctagcatgaaaggactaatcaacattcata 

acccgattcttgtgatcacggtagctgatgcagatcctcatgtgaagaaagcactcaagatg 

gctatagaagctaacaccaacagttcatctttagtccttttacctagatcatcggtcactaa 

gcttctttggatggctgatcttcggtccacagcattgccaaattggaatcgtatgaggcttt 

ccataaatatcatcacacagaatagagctaattcactagcaaggcttctcaaggctctcagc 

gacgcatactatataggcgatgaagttcctattactttcaacatggatagcaaagtggatga 

agcaactataaagcttgttaactcattcaattggcctcacggacctaaaagtcttcgaagaa 

gaatcatccaaggaggtctaattcgagctgttagtgagagttggtacccttcatcggatgat 

gattttggcctattactcgaagatgatatcgaagtctccccttactattacctctggatcaa 

atatgctgtcttggcctaccactatgaccctcaaatatcacttcctgaactctcatcgatct 

ctctttacacgccacggttggtggaagtggtaaaagaaaggcctaaatggaatgcaacagat 

ttcttcaagcaaattcatccaaacacaccttatctccaccaattgccttgtagttggggtgc 

agttttctttcccaagcaatggaaggaattctatgtttacatgaacgtgaggttcactgaag 

atccaaagcaaaatcctgttcagataccaaaatcaagaacaaatggttggcaagcttcttgg 

aaaaagtttttgatagatatgatgtacttaagagggtacgttagcctttatccgaactttcc 

aaatcaaacgagcttttcaacaaatcatatggaaccaggtgcacatattgctgctaaagaga 

atgtggttaagcataacaaggctgattttgaagtgccattgttaaaggaagatttcaagaac 

cttttgccaaatggaaaaatgcctccggtaacaaagttgccttcattgaacctcttcaatca 

gcctgtttctctaaagggattaaaagcagcaggagcaaaactagggaaagatgttattcaat 

gcagtccaacggagatagtagccgttcaccacgacacaggtttaccttcacattgtgcaaga 

ttctgaaaactccatactcgtccgatgatcacaaattaattcttttgttttctctcccaaat 

ttgccatgttacattacttggtggaaatgacagttaggaattggtgggagagaaagatgagg 

gtttgattcagctttatttctcatgcaagtaaggggaataaggattctttatgaatgactac 

tgatgagaatgtactcttgtaatattgcagccaaaattggctttctgtatcatcttcttttg 

cctcattttgcaatcaatgaaagtagacacatca 
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ccacgcgtccgtggtggcaaatcattctcttcttcgcggggttcttcatcttcgtcgagggg 
gtattctacgaggagagcagatcctagtttttcgtattcagttccctattatgcgccttctc 
cttttgggtttggtgggggtggtggggtttatgttggcccagctgttggttttgggtttggg 
tccagtgcctttcttatcatgatgggttttgctgcttttgttttggtttctgggtttctctc 
tgatcggtctgaagggggcagtgtgcttactgctactgacaaaactagtgttctcaagcttc 
aggttgggttgttgggcttgggtagatcactccaaaaggatctcaaccggattgcagaagta 
gcagatacatccacatcagagggtttaagctatgtgttgacagagacaacattagcattgct 
tcgacaccctgattattgcatctcagcttattcatctgttgatgtcaagaggagcatggaag 
aaggggagaatcgattcaatcaactttccattgaggagcgtggtaaatttgatgaagagaca 
cttgtgaatgtgaacaacattaaaaggaaaagttctacgagccagagggcaaatgg&tttag 
caatgaatacatagtggttacaatcttggtagctgctgaaggcgtttataaattgcctacta 
ttaatggaagtggagaattgaaagaagctttgcaaaagattgcatctattccttccagtaga 
acactagcagttgagattttatggaccccacagaacgaaaatgacacgttatcagaacgaga 
actccttgaggattaccctctcttgcggcctctgtaagaaaactgggatttcatgcttttct 
tttactttctaaagatcatataggctgctctcaaccactttttgttatcttcatgtatatag 
ctcgtagagcatcgataatacttgtgtaagaatgagaccaaattttcctaattgtactag.ta 
aaattgttatataaaatgaccagattctccttaaaaaaaaaaaaaaaaaaaaa 

SEQIDN071 

tgtacaaaaagcggctggtaccggtccggaattcccgggat-atcgtcgacccacgcgtccgc 
ccacgcgtgcgcaaattcgcggtgatgaagaaaatggttactcacaaagctatcaaacagta 
caaagagga'cgttttgaaccctaataagaaagatttgactaaagaaaagctccccaaaaacg 
tgccttacgtttcgtctgcgcttttcttcaagtacaacacagctctgggaccgccttatcga 
gttctggtcgatactaactttatcaatttctccattcagaataaattggatttggagaaagg 
aatgatggattgtttgtatgccaaatgtactccgtgtataacagactgtgttatggctgagc 
tggacaagctgggtcagaagtaccgtgttgctcttagaattgcaaaagatccccgatttgaa 
aggcttccctgcactcacaaaggaacatatgctgatgattgtattgtcgagagagttactca 
acacaagtgctatattgtcgcaacatgtgatcgagatttgaagcgtagaatacgcaaggtcc 
ctggtgtaccaatcatgtacattactcaacataaatactccattgaaaggttgcctgaagca 
acaatcggtggagctccaagatattgagtacgtgtttcgagcagtcaaacaatggaatttcc 
aagaccttggatagtggttcgaattcccatcacggctgtcgctgcatagattaccagatctc 
ggtgcgttgtgcaacgaaaaatgctgaagtatcagtcgaatctcaattttgtacccggtgga 
ttgttatgtgttcctcaatgataaagaaatatgttcgattttgtttagttagtatctctagg 
tgctgcccccgtgtgtcttaattaaacagccaatagcggtgtcctaaggcattccaaacaga 
actataatccatgcctcctttaatgtgtaagggggtggttatcaac 



BNSDOCID: <WO 030B51 1 5A2_I_> 



FIGURE 4 (continued) 



WO 03/085115 



39/140 



PCT/EP03/03703 



SEQIDN072 

gtacaaaaaagcaggctggtaccggtccggaattcccgggatatcgtcgacccacgcgtccg 
cccacgcgtccgcaaattcgcggtgatgaagaaaatggttactcacaaagctatcaaacagt 
acaaagaggacgttttgaaccctaataagaaagatttgactaaagaaaagctccccaaaaac 
gtgccttacgtttcgtctgcgcttttcttcaagtacaacacagctctgggaccgccttatcg 
agttctggtcgatactaactttatcaatttctccattcagaataaattggatttggagaaag 
gaatgatggattgtttgtatgccaaatgtactccgtgtataacagactgtgttatggctgag 
ctggagaagctgggtcagaagtaccgtgttgctcttagaattgcaaaagatccccgatttga 
aaggcttccctgcactcacaaaggaacatatgctgatgattgtattgtcgagagagttactc 
aacacaagtgctatattgtcgcaacatgtgatcgagatttgaagcgtagaatacgcaaggtc 
cctggtgtaccaatcatgtacattactcaacataaatactccattgaaaggttgcctgaagc 
aacaatcggtggagctccaagatattgagtacgtgtttcgagcagtcaaacaatggaatttc 
caagaccttggatagtggttcgaattcccatcacggctgtcgctgcatagattaccagatct 
cggtgcgttgtgcaacgaaaaatgctgaagtatcagtcgaatctcaattttgtacccggtgg 
attgttatgtgttcctcaatgataaagaaatatgttcgattttgtttagttagtatctctag 
gtgctgcccccgtgtgtcttaattaaacagccaatagcggtgtcctaaggcattccaaacag 
aactataatccatgcctcctttaatgtgtaagggggtgttatcaaccttgt 

SEQIDN07 3 

ccacgcgtccgcgaggcaacagatgaagcaggtgtgttgttaactatgagcacgttgactga 
agatggcgtgatttcggtgaagaatgcagcttgtgagaggttactgaatcagagggtggaat 
tgaaaatgaagtcgaaaaagttgaatgactgcttgaaccgcttccatgttgctatgccaaaa 
ccacgtgaccagaaagagaggccagcatgcatacctcaggcagtgttggaagccagagctaa 
ggaggctgaggcagatgctgagaaacagaaaaggaaacttgagagagatctggagaatgaga 
acgggggtgcaggtgtttactctgccagcttgaggaagcactatctattagcaaaagaagag 
tggaaggaagatgtaatgccagaaattttagatgggcacaatgtctacgactttattgaccc 
tgatatcttacaaaggcttgaagaattggagagagaagaaggtcttcgtcaggatgaagaag 
gagatgatgattttgagatggacggcgttgagctgacccctgaagaacaagcagcattagct 
gaaattcggaaacagaagagtttgctcattcaacagcatagaattaagaaaagcaccgcaga 
gagccgacccactgtaccaagaaagtttgacaaagacaaggagttcacttcaaaaagaatgg 
gaaggcagttatctgctttggggctggatccaactctagcaatcaatcgagcccgtagtaga 
tcaaggggtcgtaagcgagagagatcagttgaacgtggagatgacattggtaaggatgcaat 
ggatgtcgacaagattactcccaacaagaagcaaagattgaggtcactttccattacggcaa 
gatcaaggtcaaggtcacgacctccagatgaatttgttccaggggagggcttaaaggacaaa 
gcccaaaagaagatggctataaagatggctaagggttcttctaagaagaggaataaggatgc 
tcggcggggagaggctgatagagttattcctactctgaaaccaaaacatctcttctcaggaa 
agcgatcaactgggaaaactgaccggcgctagtaaaccaagatggcattttatcttggaatt 
tgctgatggtacctgtcaagatgcttgtgttgcaatatcttgggtggcggacagaaaggcta 
aaagaaaactcagcttgtgaggaagatgtcaagaattcaatctattgaaatggcaagaccaa 
gactacagattaagtatttaagtttgtgcttaagatgcagctgaacttgctgcctctattat 
gcatttttggaacttagatacctgttgtaagattgtgtttatcccgatgttaaattttgtct 
cagatttttttgattttctttagtacagcctttcctctcttttttgcatcaactttctgttt 
acacgccctaaaaggcgtattcagaaaatgtattcatctgccaatctccttgggatgttttt 

tttttttgggaa 
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ccacgcgt ccga t t gt t aat tact get tctgtccccacaccacttaagagcacct cat teat 
ggcttctcccaactcactcaccactttctgcattatccagtgctcattttgctgtaaactac 
agttatttcttagctgaaaatccaagatttgctgttaattcttgacccttttgccccttctt 
ggattttctgttatttttggattcttttttgtgtcttgaagcaaaggaaggcagaaaatgag 
aggaggggtcagtggaagtttcaaacttgagcttctggttgtatttatactgcttctttgta 
tccgagactccaattgcagctcactgaagcatctaaaaggctctctattcaaggacataaag 
gaggacactcttttgccagagatctccccaaatgctgctccacagccccttcttcccctatt 
tgcaccttctccattggcacctttcacaaacagcactttacccaaattatctggactctgta 
cgcttaactttgatgctgtgagaagtatgatgaccgtgacatcaatagattgtgtagcacca 
tttgcacagtatctggctaatgtcatgtgctgccctcaactggaaacaactcttgttattct 
tattgggcggtctagtaaaaaaacaaatatgcttgcattaaatgggaccctcgcaaagcatt 
gcctttcagattttcagcaacttctggtgagccaaggtgccaatgatactttgcagcatata 
t get ctctccatccgtct aat ct tact caaggttcttgcccggtcaaagatgtt cat gagtt 
tgagacgactgtggactcgtctagcctacttgctgcctgtggcaagatcgatcttgtgaatg 
aatgctgtgagcaaacctgccaaaatgctatatcagaagctgctaaaaaacttgcacttaaa 
gcatatgatcttttaagcatggatggctctcatgtgctggctgatcacacgaccagagttaa 
cgactgtaaaagtattgtacaccgatggttggcaagtaaacttgaccctgctggagcaaaag 
atgttcttagaggactttctaattgcaaaaacaataaagtgtgccctctggcttttcctggc 
atgaaaaatattacaaaggcttgtggagacgggatgaataaccaatcaatatgctgtaatac 
tgttgagaggtatgtctctcacttacaaaggcagagcttcgtcaccaacttgcaagctttgg 
attgtgctgcttcacttggtcttaagctacagaaagccaatgttagcaaaaatgtctacaat 
ctctgtcacattagcctcaaggatttttccgtacaagttgcaccagaagtttcggggtgtct 
tttgcctagtttaccgtcggatgcaatactggaccaaagtacggggatcagttttgtctgcg 
acttaaatgacaatattccggctccttggccatctatgtctcagttaccagcttcgtcatgc 
aataagtctgtgagaattcccgcacttcctgctgcagcatcgggccaaatcagtaaaggatt 
aaatatatggtcacatatgctactgatggcgtcgatgatattgggaatctgctgtatatcta 
atgctgccaatcttgcttattagctgtattttgtggaagcacattttgaccagaaagaaaat 
tcaaaaattacagttctatgaaggtctctgattgacatcaaaacttaaaatgtacagatgca 
ggaaaatcatgcacctgagtgaaaatccaactcagagatgattccaagatcaaattcgcgac 
gaaatttttattccctttctttgggcaataagaaagttgtgaaaaaaattacacagcaggtt 
tagtttcatgtaattatttccacttgacatactttgcctttatgtatttggaattcctcaga 
aaaaaaaaaaaaaaaaaagggcggccgctctagag 
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acagtttgtacaaaaaagcaggctggtaccggtccggaattcccgggatatcgtcgacccac 
gcgtccgggatcaagaccctatattgcgttatggtggaatgtatgctttagcattggcttac 
agaggaactgcgaataataaagctatccgtcagttgctgcattttgctgtatcagatgttag 
tgatgatgtccgccggacagcagttttggcacttggatttgttatgtattctgagccagagc 
agatgcctcgtattgtatcgttgttatcagagtcttacaatccacatgttcgatatggtgcg 
gctatggcagtaggcatttcttgtgcaggtactggtctgagtgaggccatctcattgttgga 
gcctttgacatcagatgtggttgattttgtacgtcaaggtgctctcatagcgatggccatgg 
tgatggtccagataagtgaagctagtgattcccgcgttggtgccttcaggcgacaactggag 
aaaattgtcctagataagcatgaagataccatgagtaaaatgggtgcaattttggcctctgg 
tattcttgatgctggtggaagaaacgtgacaatcaagttactttcaaagactaaacatgaca 
aaattacagcagtcgttggactagctgtttttagtcagttttggtattggtatccacttata 
tatttcgttagcttagcattctcaccaacagccttgattggtctcaattatgacctaaaagt 
gccaaagttcgagtttgtatcacacgctaagccctcactatttgagtatcctaagccaacca 
ctgtagccaccacaacttctgctgtgaaacttcccacagctgttttatcaacatcggctagg 
gctaaggcaagggctagcaagaaagaggctgagaaagccattgccgagaaggcagctggaac 
agagtcatcttctggtgcaccaagttctggggagtccatgcaggtggatactccagcggaga 
agaaaaatgaaccagagccatcatttgagatgttgaccaaccctgctagggtggttccagct 
caggagaaatacataaagtttttggaagaaagcagatatgtgccagttaaatcatcaccttc 
tggatttgtgcttctgagagatctacgtcctgatgaacctgaaatattgtccctcactgatg 
caccctcgtGaactgcatccagcactggtggtggatcaactggacaacaggccccggcatca 
gcaatggctgttgatgaggagcctcagccaccaccggcatttgagtacacatcgtgatttat 
ttgtattttaaaagcttcaccaatactttggttttcattccattttggagacgatgttgaat 
ggcagaggtggaaacctatggatcaaatagcacttcctatgatcgagttgaattgtgggata 
cattgaaaagagccccgtggatactgttattctgcctcttgatttccagacttgtgcttgtg 
cttgtcattgtatttcctatgcaagagggactcaaaaactggggactggaaactgccattgc 
gcgttatctttttctgaatctgtcacgtcagctctgtctggactgttagatttttactttat 
gttctaattaagattttatattgttcggatctacaaaaagatttccactgttctccccgagt 
atttatagtcc 
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ccacgcgtccgtaaaaccctgcggcctatcggtttatcttctccctccatttccactgtacc 
atacaatacaatggccaccacttcccttagaacgcccaccactaccgtaaggccgccgtcca 
cccccgtcagcgcctccgcggtgaaacccaattgtatcactttcttatcctacttacaccgc 
agacgggggcagactgcactacttccccgacggtgtcgtattca'ccactccgctaacacaat 
tgttcagctgccgcaccggtcggttgagaaatttatagtctttgcttcaaatggggatgctg 
ctgaggccgctcaaaccgagactcaggaacctgagcaggaggtacaagaatccgagcaggag 
gagaatgtagatggtgctgctgctgaagatgcttcggatgagggtgacaatgcagctgcaga 
tgaaactgcatcattcattgcaacttcattgcagttgtacagagatgctttagcaaataatg 
acgattcaaaagttgcagagatagaaatttccctcaagtccatagaagaagagaaaattgaa 
cttcagagaaaagtagcctcattgaccgaagaactgtcaagtgagagcgaccgggtt cttag 
aatcagcgctgacttcgacaatttccgtaagagaacagagagagaaagactttctcttgtga 
agaatgcacaaggggaagttgtcgagaaacttctatctgttctggacaattttgagagggcg 
aaaatgcaaatcaaggtggcaacagagggagaagagaaaattaataatagttatcagagcat 
ttctaaacaatttggggaaatccttggatctcttggtgttgagactgtggagacagttggga 
agccattcgacccattgctgcacgaagctataatgcgtgaggattcagaggaatttgaagaa 
ggtgttgtattagaagaatatcgcaaaggtttcaaacttggagacagactcttacgtccttc 
aatggtgaaggtgtcggctggcccagggccggcaaagccagagacagcggagcctaaagaag 
agcaaaacgaagtcgaggagaagagtgaggaaggtactgctgaaacagcaggtgatgaaggg 
acaggtgaaggaggtaactaactaccagtgatgatgtgacaagtgagggatgtaacctgtga 
tttctcttttgtacaagcaaagaaaaggacatatttcctggtttgattgaggttgagatagg 
tttttgctggtatacctttcaattttcattaactactgtttatctgaaaggacatcatttta 
ggtcagtcggcttatgactgctgtcttaaacactattttttgaggctttggatagttgagga 
ttcatatagtcgatcccaactagcttgggatcgaggcgcaattgttgtaatactccggaaac 
aagagcgtaatgtcatatgccagactgaca 

SEQIDN077 

ccacgcgtccgaaatatgccagggattatttctcgtccgagatgatgttgatgggcaattac 
tacaataaccttggaatgaatttcaatgtaaataataatggcggcggaggaggagggatgtt 
gttttctgggaatccaagtgcgatgacaaacagtggacggagtagcatcaataattcagtaa 
tgagtcagtctggaggttgttcgagttctttttttatcgattcagtgccggggctcaagcat 
gatactgggctggcggtagagtggacccttgaggaacagtacaaactggatgaaggacttat 
caagttcgcgaatgaacccagtataatgaagtatattaagattgcagccgcgctccgtgaca 
aaactgtacgtgatgttgcattaagatgtaggtggatgacgagaaagcgcagaaaacaggag 
gactatagtttggggaagaaagtgaaagacaggaaggataaatcagcagaagcatccatgaa 
aactggtacatcctcagcttcgccattgagcttcattccatattcactctcctcaaatcatc 
gtaaccatggtgaaaatatcccttctgcagcattacttggaacgagacatctactggaagaa 
aacaatcaggctctcaatcagatttcggccaacctttcaacagtcaagttgcaggacaacat 
tgatctcttcatccgaacgagaaataatataacggcagttttaaacgacatgagaaatatgc 
cagggattatgagccaaatgccaccccttccagttttgttgaatgaggaacttgctagtagt 
gttttgcctagtatgactcagccgatgatgtttggctccacaagtggaatccagctgaaaca 
agagccaggctgctgatgcaaaacgcttggtgttaaatttggattactagcttgtgtaagta 
caccaaattttttgctgtaaatgcataaaaagctggcaggtctttgcagcttgggtatacga 
ctgggttccacgggaagaacatttatgagaacctgttttttggaagctgaacatctgaacac 
aagcaccaggaaatagcagcct cgtgttattgcatatcaggggaaaaactgttatcttgata 
ctgcacttacaagcatttttcttcttcttgtttcagccttctgtgtgtaaatttaggggata 
aatcgatctcaaaatcgatt 
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ccacgcgtccgcggacgcgtgggcgcgaactcagattctcaataatggcggcatcgtacgag 
tacgaagacggaggtttccaacagcaaccggattcagccgggtacgacccgaattttgtgcc 
ggattcggtaaaatcgttcgtggttcatctgtacaggcacattagggagaaaaatgtttacg 
agattcaccagatgtacgagtcctcttttcagactttaagcgagcgtatgttcaaggaaact 
ccatggccttcagttgatgccgttgcgccttacgttgataacgaccacgttttctgcttgct 
ataccgtgaaatgtggttccgtcacttgtacgctaggctttctcctactcttaaacagcgga 
ttgattcttgggataattattgcagcctttttcaggttgtgctgcatggtgtggtgaacatg 
caattgccaaatcagtggttgtgggacatggtagatgagtttgtataccaattccaggcatt 
ctgtcaataccgtgcaaagatgaagaacaaaactgcggaggagattgcattgctgaagcaat 
atgaccaggcttggaatgtctacggtgtcctcaacttcttacaagcccttgtagagaaatct 
acgataatccaaatattggagagggagaaggaaggtcttgaagagtttactgctactgatgg 
gtatgattacagtggtggaagtaatgtcttgaaggttttgggttatttcagcatgataggct 
tgctcagagttcattgtctgttgggtgattatcatactggcctgaagtgcttgcgtccaatt 
gacataactcaacaaggtgtttacaccagtgttattgggagccacataaccacaatttatca 
ctacggctttgct.aatcttatgttgaggaggtatgtagacgctatccaagaatttaacaaaa 
tccttctatatatttataagacaaagcagtatcaccagaagtcaccccagtacgagcagata 
ctgaagaaaaatgagcagatgtatgctctgttggccatatctttgtcactgtgccctcaagt 
gaaacttgttgaagaaactgtcaattctcaattaagggagaagtatggtgagaagatggcga 
gaatgcaaagatatgatgatgaggcttttgccctctatgatgaactcttctcatatgcatgt 
ccaaagttcattactccctctgctccaagttttgaggagcctcttgtaaattacaaccagga 
tgcgtataggctacagttgaagctcttcctttatgaagtgaagcagcaacaattgttagctg 
gtgttaggacctttttgaaagtctattcaacaatctccctggggaagcttgcaaattacatg 
gaagtggatgaacccactttaaggacaattttgatgacatacaagcacaaaacacatgctgt 
cgattccgatgggaagataacttctaatgctgatgtggacttctacattgatgaagacatga 
tccgcgtagtagaatctaaacccgccaagaagtatggagattacttcttgcgtcagattgtg 
aagcttgaagggatcatgactgatattgacaggataaagctggagtaagctatcttcctatg 
ttctagtattagtgctagcttattttgagctttcatttttgtactcgaaagcaagaaggaaa 
atgcataaagtggaaaaagtatacattttgttgttccccctctgagactgtgttaccggaag 
ttgttgataaatgaccagttaaatccatttttttctaaaaaaaaaaaaaaaaag 
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ccacgcgtccgcccacgcgtccgcttttccatcagatttcagctcttttactccacagctgc 
agcaactacattggtggactttctggtaaattcacttggcttctccaacaaagaagccattt 
ctacaagctccaaggtaactcgttcgacactccgaaattatgagccacaattgttacttgat 
ctctttcacaaagtgggtatgaataaaacccagatcaaaaccctcgtttcttcttcccctga 
attgttgttttctcatattgataaaaaccttaaacccaaaattatggttttacaagaaattg 
gcttatctgggtctgaccttgttacatttatcaataaaagcgatttcttgatgagaggttta 
catactactattaaaccaagtcttgattatcttcgggagtatttgggcagttatgatgctgt 
agctagggttattaagaaagagcctaggctgctttccagtaatctGcctaaagtaataccac 
ccaatatactattgttgcaaaatcttgggttttcgctaggggatattgagacggtttttcat 
cggcgtcctaggtatctgcttaataaccctgagtggcttgagagagtagtaaatcaagcaga 
aaagagttttaacgtacctcgggagtcacggatgtttcttcatgccattgaagcacttgtgt 
cgcttgatgaatcaaaattagaaaggaaattagatattttccggagttttggatggtctgat 
tctgatatctgtgcaatggtgcgaaaacttccttactgtttgacttcatcagaggctaagat 
aagaagtacattgaaatttttcatgaccgaacttgggtatgaacctagttatctggcttctc 
atgcaccacttttaaagtacagtatggagaagagggtcaagccaaggaatgaaatcttgaag 
tttcttaaagaaaaccagctgataaaagggaaactaagtctttacactgccgtgtcatctcc 
tgaatcacgatttcgtaagaaatatgttcttcctttcaaggagaagatgcctgagttgtatg 
atttatacatcaaaaatacaagctaaagagaggtcttcacagtgtgacagtggctgcagagc 
agtgcttgtttaagaggtttattcacttcttgataattttgtactttcattttggtgctctt 
ttcaagcatgttgctagtttacctttcattgttgattatacatttatcaaaaaattactgag 
ctatgaaaactagaaattgaggctagtctcattttcaaatcaactgatgtttcttgtttaat 
gggaaggaaagaagtgtagaaaccagacttgatgtatatgccattgattataaaaaaaaaaa 
aaaaaaaaaaaaaaaag 
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ccacgcgtccgcccacgcgtccgaaaaaaagtaaaccggactattctcgaagaaaattttcc 
ggcgactgtgtaaattattttccagtgaagtttccatatttgtatactcattgtcaattgat 
tgcgttgccgtgttcagattctccattaccgatttggtaattaggtttcgagatcggttgtt 
tatcattcttctattcgttaattcgggttaacaggaattttttgatttcgattcgataatcg 
ggttcaagtatttcagaaagagaaacaaaaaggaaaaaggaggtctaaaatttgtttcaaag 
tgaagggggtttaggttgaaagatgttgaccaccacggttggtttgtgatataatatggcac 
gggttgttacagataaagatatgtcgttttacattggtcgcgaggcttcaaagttgtggaag 
agattttgtgcggagataacaacagaaatcaatcttcttgctgagaattggaagtatattct 
tggcggtttgatttgtcagtacatccatggacttgctgctagaggtgtgcattactttcatc 
ggcctggaccaattcttcaggacgtcggcttctatcttcttccggagcttggacaagataga 
gcttacataagtgaaactgtatttaccaccatctttctaacttttgtcttgtggaccttcca 
cccttttattttcaagaccaaaaagatctatacagttctgatatggtgcagggtcctggcat 
tcttagtcggttgtcaattccttcggatcataacattctattctacacagcttcctggtcca 
aattatcactgtcgtgagggttcaaagcttgccacgcttcctcctcctgacaatattttaga 
agtgctattgattgttcctcggggcgtgctttatggttgtggtgatctgatattttcatctc 
atatgatattctctctagtctttgtgcggacataccagaaatatggaacacgaaggtttata 
aaacagtgtgcttggttagctgttattgcacaaagcttattgattcttgcatcgcgcaagca 
ttacactgttgatgtagttgtggcatggtacacagtcaaccttgtagtgttcttcattgata 
aaacgttaccagaactgcctgatcgcactagtgccttgttgcttccagtgaccaaggatagc 
aagtctaaagaagagaatcacaaactgctgaatgggaattctggagatcctgcagaatggag 
gcctcgaaacgggaagatcgtggaagatgggaaagcagtgcacgttgaagcagtaattaatg 
gtgcatagacgacccactaactgatgcttgcaaccttggtacagagattggtaacaatgcca 
ttacaagttgtgttaatataaatcattcctggtgctcttccaagttcaatagttttggtttt 
agcgtaggatacgaaatcaagtcaaggattgaaatgctatggatgtctccacggtcccctgt 
ggttaaatttaatgttatcaaatgtatcatggaattcattttattttttggttaaaagcaat 
tattttcttatttccaaaaaaaaaaaaaaaggg 
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ccacgcgtccgcccacgcgtccggcaacaacagcaacagcaactcttgaaggctattcctca 

gcagagaaacccacttcaaccgcaatttcaaccacagaatcatgctataaggtctcctgtaa 

aaccagcttatgagcctgggatgtgtgcccgtcggctgactcattatttgtatcagcagcaa 

cacagacctgaagacaataacatagagttctggagaaaatttgtcgccgagtattttgctcc 

aaatgccaagaaaaagtggtgcgtctctatgtatggaagtggccggcagaccactggagttt 

ttcctcaggatgtatggcactgtgaaatatgcaaccgcaagccaggccgtggttttgaagcg 

accgctgaagtcttgcccaggcttttcaagataaaatatgaaagtgggaccttggaagagct 

actctatattgatatgcctcgtgaatatcagaattcatctggacaaattgtcctagactatg 

caaaagcaattcagqagagtgtttttgagcaacttcgcgttgtacgtgatggtcagcttcga 

atagtgttttcacagcctgatctaaagatcatctcttgggaattttgtgctcgacgtcatga 

ggagctaatccctagaagattgttgatacctcaggtgagtcaactcggcgctgcagctcaaa 

agtaccaggcagcaacccaaaatggatcatctactgcatctgtttctgagttgcagaataac 

tgcaatatgtttgttgcctcagctcgtcagttggcaaaagctttggaagttccattggtaaa 

tgatctaggttatacaaagagatatgtgcgatgccttcaggtatgcaccttgttctgatgct 

ggaaggtttttattttggcccttttctggaatttggagacattccgcttgtatcaatgtgga 

tatcactacaaattcttgaaatatttgcttctgttagtgcttttaacttcccgaccaggtca 

tggtttgcctgttctgtgcgtgatatctgactcaaccagttctatccaactttcttatcctc 

tccggcccccctctccttttaatctgtcatcttcccgtggaattcaaagcaaagctgaaaat 

gaaggtcgaaattcagatttctagcatgtagcagctcagacaaccaagaggttgtgagttcg 

agtcacccaagagcagggtagggagttattggagggagggagccgagggtctatcggaaaca 

gcctctctacccaccccagggtagggctaaggtctgcacaaactaccctccccagaccccac 

tagtgggattatactaggttgttgttgtaatattggaaattcctactggtaaatctgactca 

tgagttaatgtgtgaagtagacggaatgatgtggcca 
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ccacgcgtccgctgacgcgtgggttccactacatcaagacatctactacactcatctttttt 
gcacttattgggtgtaaatttttgaaacccagttgagaaaaatgagtgtgttacaataccca 
gaagggattgacccagcagatgttcagatatggaacaatgcagcatttgataatggagattc 
tgaagatttgtcttcgctgaaacgttcttggtctcctctgaaacccctttcggttaggccat 
cagattcctttgaatctgatttgtcaagtaaggaaaatcaaactcctttatttgagaattca 
tctgttaatctctcatctccgttacccataaagccacttaaccctaatggggctctggaaaa 
ttcaagactcaagccgaacaagcccaattccaaacagagtctt gat gaga tggcggctagaa 
agagcggaaagggaaatgatttccgtgatgagaagaaaatagacgaggaaattgaagaaatt 
cagatggagattagtaggttgagttcaagattagaggctttgagaattgaaaaggctgagaa 
aactgttgctaagactgttgaaaagcgaggaagggttgtggcagcaaagtttatggagccaa 
aacaaagtgttattaagattgaagagcgtatatcaatgagtgcaagaacaaaggtggagcag 
agaaggggtcttagtttaggaccatctgagatttttactggaacgcggcggcgagggttgag 
tatggggccatcagatattctagcagggacaacaaaggeacggcaattgggaaagcaagaga 
tgattattactcctattcagccaatacaaaacaggcgaaagtcgtgtttttggaagcttcaa 
gaga ttgaagaagagggaaaaagttcaagccttagtcctaaatcaagaaaaact get gcaag 
aacaatggttacaacaaggcaggcagttactacaattgcatcaaagaagaatttgaaaaaag 
atgatggacttttgagttcagttcagccaaagaagttgtttaaagatctcgaaaagtctgct 
gctgctaataagaagccccagaggccggggagggttgtggctagtaggtataatcagagtac 
aattcagtcatcagtagtgagaaagaggtctttacctgaaaatgataaggatgagagtaaga 
gaaatgataagaaacggtcgttatctgtagggaaaacgcgtgtgtctcaaactgagagcaag 
aatttgggtactgaaagtagggtgaaaaagagatgggaaattcctagtgagattgtagttca 
tggaaacacagagagtgagaaatctccactaagcattattgtgaagcctgatttgcttccgc 
gaattaggattgctcggtgtgtgaatgagactcttagggattctggacctgctaaaagaatg 
atagagttgataggcaagaaatcgtttttcagtagtgatgaagataaggagccacctgtctg 
tcaagttttaagttttgcagaggaagatgctgaagaggaataatgtgtaataaagggagctg 
ctaactcttt teat get ctttcaattttcaatcctgccttttaatttttgttcattcgtgcc 
ttttaattgaatggggaagcattcttttgcttcctcaaaetggtattctagcttctgaatta 
cattgtatggtacaatatgaataaggttttgtcttccggcaggttgtccaagttagttttta 
gettaaaatagatgeggea 

SEQIDN083 

ccacgcgtccgctttcacaaagcattgtgtgttctgatgggatggaactatctggcacctcc 
tgataactcgaagtcatatcagatgttatctgctgatgaagcaactgcaaaccgagatgatc 
tagttttgtggccecctttggtaataatccacaacactatcacaggaaaacgtgatgatggc 
cgcatggagggtttgggaaacaaggcaatggatagttaccttagaggtattggatttcacaa 
tggaaaggtgaaggccttgtataacagagaaggtcatctaggtgttactctggttaaatttc 
caagtttaatggatgccatgcggttagcggaatattttgagaaagataaccgcgggagaaaa 
ggttgggctcgactgcagcccgtgactctaggcaaggatgacgagaacaaccctgaccttgt 
caaggttgatcataggactggagagaagaagagagtcttctatggttatctgggaactgtta 
gtgatttagagaaggttgatttcgactctcgaaagaagattaccattgctagccgatcagat 
tatgtgacatctggttagaaccacttgaatagctttacattaagatgtgcttcagttgagaa 
ttttagtcaattccctgptctagatattctggctttgtgttacttttattgcccttaggaat 
tggggcagctttctctgggataactgtggagctaagttataagtgccatgcatgcgtcttec 
cctcttctagtgaatctttctgcctagattagcagttttaaagtccaatggactcgctgatt 
gttcttgtccttgtcccgccttcctcggtttgaggctgggtgtaccatttgggtttcgaaaa 
gttcaggca 
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ccacgcgtccggccgaaaacaatgggaagaggaaagttcaaaggaaagcctactggtcgtcg 
ccagttctccactcccgaggagatgattgctggtacttccgctcgacgtcctcgcacgttta 
ggcaggaagaggctgaacttgaggaagaagagagatctgaggagtctgaagaggaatctgaa 
gaagattctgatggagagaagaagaaaggtactcagggtattattgagattgagaaccccaa- 
tttggtcaagccaaagaacgtgaaagctaaaaatgttgatattgagaaaacaactgagcttt 
cccgacgtgaaagggaagagatagagaaacagcaagctcatgaaaggtacatgaggctgcaa 
gaacaaggaaaaacagagcaagctaggaaagatttagaccgcttggctctcatacgacaaca 
aagagcagaagctgcaaaaaagcgagaggaggagaaagctgccaaagaacagaagaaggtgg 
aagctcgcaagtaacgaatagtaccatgaaatgttgttttcaattctcctagtacaagatat 
atccctaccattattggctaatgatggagtttacacttccacctttcgttcatgtcctgtct 
agtttaaatggagaagagttctctatagaggaaatcatgaaattatactttaagctctgatt 
ctgtacacaaaatagatttgtttggccaatatgatgggaggatttaccagtccttttgttgg 
gttgaaataaggttattgcgactaattaaactatcttgcagtgtgtgtgctatgaggagaaa 
tactttccatggaaaatgtttctaaggaaagtggttttttaaaaacttattttcctgtgttt 
gcttggtgtgtggacgcacgatctttgtatccctgaggtgctttttcaaaagattggaatat 
ataatggtttga'gcaggc 

SEQIDN085 

ccacgcgtccgctgctatctgatattagaagtatattggcaacgacacagatgtggccttta 
tatgttctcgttgagtaagttttcaaggacagaagaagaatactctcaaggatctggaaggg 
atcgtttaagcttggtagagtccttcctttgtaaacgatcagttggcacaaacaaatgccac 
aatacagtcctctcatgttcgtacttttgtgaagttccacatattaaccagttacattcatg 
ggactgtggtcttgcttgtgttttaatggttttgaggactctcggtaaggattatgatatgc 
aagaacttgaagagctttgctgcactacaagtatttggactgttgatctggcatatttgagg 
cagaaattttctgtcaacttttcctactttacagtcacattaggagcaaatccaagtttctg 
cgtggagacattttacaaggagcaattgtctaatgatctggtccgagttgatatgctattcc 
aaaaggcacgtgatactggtattaatatagagtgcagatcgattagcagtgaagagatttct 
tcattgatcttatctgggaaattcattgcgattgctttagttgaccagtacaagttaagtca 
ctcttggctggaagatattggtatatcagacttctgcaatgacaacccaggctatactggtc 
actatgttgtcatctgtggatatgatgctgatacagatgagtttgagattcgtgatcctgcc 
agttcaaggaagcatgtaaaggtctcctcaaggtgtttagaaggggcccgcaaatcatttgg 
aaccgatgaggatcttttactgatccgtttacagaaggaagagactgaaagcagccctttgt 
gatcgtttatttatttgtgtatgaatgattgtttctctctgactttgtccccgctgcgtatt 
gcccatatcgggtattctttagctgtatgtatattatgtacatcaagggctgtagtatcatg 
aatttcgcttccctgtatcatgaattttgtatatgatgcttggagcacc 
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ccacgcgtccgacttgattctctgctctccccttgactttcacacactcaaattcattttca 
tatccactctcaaatccagaaatgcaaatcccccatccccacccctccacccccatctccga 
cgccttcgtccacaagcgcggtaaaaaacgcggtagctacaactgcggccgatgtgggcaac 
ccaaaaaaggccacgtttgccatatctctaacgaccttaataatcatacagatgttcctact 
cccacaccgaccgatgccaaatcatttgttctcccttctccgctatccgttattcgtcctca 
gcaacttccacctccgccacgtcagccacttccccagctccggcgagcgctttctttcgatg 
acgtggatgtcagcgatgatgagtcgcctgtatctgatgatgatgacgtggattgtttggat 
ctggagagtgagttggatttaggtgggtccgggaagttacccgcgagtgctttgtgggaagt 
gcttaagagattgcctccatcagcgctgctttctgcggcgaaggtgtgtaagggttggaggg 
atgtttctagaaggatctggaagtcggctgaggagctaaggcttggagttcctgtgaaagct 
cagattgggcttgttggatcagtgttgcagaaatgccctggacttgttaagctttcacttag 
aatggaaagtgatgtggacgcaacgatgctggcttgcattgcattttcctgccctaatctgg 
attcaatggagatccttacttcagatacctcagttaatcggatcacaggggatgaattaggc 
cgttttgttgctgacagaaggtgccttaccaatctcaagatggaaggctgctcaaatcttgg 
ggcctttac'tctttcttcaaccagcctttccactctttgcctttcggatctcttttgtcact 
ctaagatggtcttcaactgccccaatttaaaggagatttccctggatttttctcgccaagag 
aaggatagcactgatcttactgctatggtagatggtcttggaaggagctgcccaagactaca 
gaacattcatgttgcatctgttcggcttacacatgctgttgtgcttgctctaacagcagcaa 
atttaaggggattacgaatgctttccctagtactagggtcagaaataactgatgcatctgtc 
gctgctattgcatcgagctactcgaggcttgagttacttgatttgagtgggtcaagtattag 
tgacagtggcattgggatgatatgcaatatatttccagagacattgtctaaacttctccttg 
ctctttgtccaaatatcacttcaagtggcattcaatttgctgcagctcagttgcctaatcta 
gagataatggactgtggaatgaccatatgtgatccagatttagacagtccaacaactcagga 
aaatgataacggcgaattacaaagaacaccgattagtaaattacaccttatatatcagaaac 
tgattatcaaacacaaccgcttaaagaaactcagcttgtggggttgctctggcttagatgca 
ttatatctaaattgcccagagcttaatgatttgaacctgaactcctgtacaaacttgaatcc 
agaaagattgctacttcaatgccccaatctggaaagtgtgcatgcatcatgctgtcaagaca 
cattggttgaaactcttcagaatcaggtttgtggtgattttatggctggagacaatcatttt 
ccatccaaacgtcttcctgatggctcaaaggatcagagttcctcatttattcagcccccagc 
catttgatgatgagaagagaaagagaaggatttcaaagcgacggtgcgcggtgcttgtttat 
tagtcaaatacttgtcttgtattggctttgttgtactctagaccaattgtccattatttgtt 
atatagtgatctgaggctaaggcctgatcatgtaattttcattgattaaactatactcaacg 
tcaatacagggattgtatttcctctatcaataaaaagtacagcagcc 
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ccacgcgtccgtctcaatccaaacttcgagttcacatttgccctagctttgagaaatgatca 
tttgcaaactcaaattattgaattagatcataatagaatccgtaatatacccaaattgtatt 
gttttgttttgatgtgtgttgtgaaaatactgtaaattttgaacaattcgattatggccgat 
ggtaaggtggttaagcgtgtcaagtacaaatcttcagtcaaggaccctggcgtttctggcgt 
tttgaagttgaccaaggaacggtttttctttatgccgaatgacccaacatcaacgacaaagc 
ttaatgtggagttcaagttgattaaaggccacaggtcttctaaagagggttcaagtaagcag 
gctcttcttaatctcatgcacgatcagggcaggaattatatttttgagtttgatagcttccc 
ggaccgcgacaagtgtcgagaatttgttgcctctgcaattgcggtttgtggagaagttgtga 
aagctgcttctgaaaaacctgctgttccacatgatgaacaactcagtgcagcagaaatggga 
cgtcggattaagt tact gcaggagaatagtgaattgcagaa act ccacaggcaattggt cat 
tggaggtattctatcagaggctgaattttgggccgctaggaagaagctactggaacagggcg 
atatcaagaagccaaaacaacgggtggctttaaaaaacgacatgtggagtgtaaaaccttta 
tccgatggccagacgaacagagttacatttaacttgacaccggaggttattcatcagatttt 
tgctgagaaaccagctgtccgccaagcatatttgaaatttgttccgggcaagatgtcagaaa 
aagaattctggactaaatattcaagagctgaatacctccacagcacaaaaaatattgttgca 
gcagctgctgaggctgctgaagatgaggagcttgcggttttcttgaagcaagatgacatgtt 
agcatttgaagctcgtaagaagatcagaagggtggatccaactctggacatggaagcagatg 
aaggtgatgattacatgcatctcccggatcatgggctacctcttgatgaaactaaggagatt 
ctggaaccacagtatgaaccattcaagaggtcgttctcgcagtacctcaaccagcatgcagc 
agtagttcttcgaggaagagttatagatgttgagctgggtgacacaagatctgttgctgaag 
cattcatcaggacaaatcaggctgaactagctgccgaagtgtctgatgagagtgcatataga 
gaacgcatagctaaagtttctcgagttgctgaaattgaggatcttcagggacctcatgagcc 
accagttgcattgctaagtatcaaggatcctcgggattactttgattctcagcaagcaaatg 
caataaaggctttgggggatgctggtacagggacaagacagctgaaatttagtgtgagcaaa 
gaagaagccttttgctccttgaagaactccatcttcgagataaattcacaaggattgatcga 
accaataattagtccagaagtagctctcaaggttctcaacgggcttagtcagaatatctcga 
gtacaaagtatcatctgggaaagaacccccatgagagtgttttagataggctgcctagtgca 
acgaaagatgaactattactccattggacatcaattcaggaattattgaagcacttctggtc 
atcttatccaataacggcaaaatatttctacaccaaggtgactagattaaaggatgcaatgt 
ctcagatataccccaagttgcaggagatcaaggaatctgtgcaatcggatttcagacatcaa 
gtttcccttcttgtacagccaatgcttcaggctttagatgctgcctttgcccattatgatgc 
agatatacagaagagatctgccaaaagtggggagagaccaaatggatttgcttaggcaaaat 
ttttctccattttcatccgatatttaagctctttgttttctgggggttatatacacgaatgt 
acatttaacaaaattttgttcgagtgtgttatagcatattctatatcttgacagttctaatt 
gactgcctgcggtaattgtacatctagtggaataatggttg 
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ccggtccggaattcccgggatatggtggacccacgcgtccgctctttggatgttatggaagg 
atcaaagtggaatatgactcgaagtggcttttgtggaatgcggtctaaaaagtttgatggct 
tcatcgatttggatggatatgacacgatagcgcttaaacttaaaggagatggaagatgttat 
atttctactatatacacagagaattgggtcaatactcctggaca'agatgaagataattcatg 
gcaagcattcgtttttgtgccaaaagaaaactggtatattgcaaaaatcccgcttactcgtt 
atgtacctacttggagagggaacatgataaatgcaaagttggagatgaatccagctcgaatt 
cttggcatgtctctatctgtcaacgcagaaggtggagttccaggtgccaagtctgggcctgg 
tgatttccaagtggaagttgattggattaaagccttgcggatgcagtaagcaaaaggggaac 
atcttaaagaattattagaataggctgggacatttggggcatccacgctcaccagttgagca 
agattgtggaaatgccattcagagatggagaagatacaggttctttttctatgtaccttgga 
ggaaaagagagaattgagctgaggaaaggagtgaaaccttaaaatgcagtgactacaggcca 
caccaccaagtcaaattatcagatttttttcttgtaataaatggggctcttcaatttttctt 
taggctatcaactagtatggtaaactaagcagtatgtttaataattatatcctcgtctgtta 
caaggtttggcaatcaaataatacaacaatgtgcttggaatcggtagtactgttaaaagatt 
taatgtcaatgtgcaatgcgc 

SEQIDN089 

ccacgcgtccgacctcttcacctttacaaacttctacaaatatatttacttcaaacaactga 
gtagtcctattgtttctgattcgatgtcggttagaacagtgaaagtgagcaatgtctctctt 
ggtgcgtcggagcaagatatcaaggagttcttctcattctctggggat'attgagtatgttga 
gatgataagtgagaatgagcgatctcaaattgcatatgtcacattcaaggatccccagggtg 
cagaaactgcagttcttctttctggagccacaattgttgatcagtctgtcatagtagccctg 
gaacctgactacgagctgcctcctacagctccagtgccaatcaaggcaactgagagggctaa 
tgcagctggtggtggatctgctattcaaaaggcagaagatgttgtgagcagcatgttggcaa 
agggcttcatcttgggcaaggatgcagttaacaaagcaaagacatttgatgagaaacaccag 
ttcatatccactgcatcagccaaagttgcttcactagatcaaaaaattggacttagtgagaa 
aatcaatatgggaacaactattgtgaatgacaaagtgaaagaaatggaccagaagttccaag 
ttactgaaaagacaaaatcagcttttgcagctgctgagcagacagttagcactgctggatca 
gccatcatgaagaacagatatgttttgacaggggcatcttgggttactggtgctttcaataa 
ggtcaccaaggctgcaggggaagtgggccagaagacgaaggaaaagatggcagaagaagaac 
agggaagaagttcagctgcaggttacgtgcctatacatgctttctcggagtccccaaaagct 
tccaaaaccgaggaacctgccaagccctcttcacctaagggcctaattctctagcttgtgca 
aaaatatttcaaaactattgttcaattccgcttgtctgatcttttagctgtcattgtgttgt 
ggttagacttagatatgctagttatacataaaatgtcctgtacgattgttgatacatggaac 
gatagttgctggactattaaattccctgtcggagtgctgtgcgg 
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ccacgcgtccggagagtaaaagtggatcctatgaggaagagcgtcagtttgaacaatctctc 
acagtacgaacagccaaatgctaacaacagcgctgatacatctaaagtggctgaggaaggat 
atgcctctgcagatgacgctgttcaacaccactccaacagcggtcgcgagcgtaagcgagga 
gtaccatggacggaggaagagca.caagttattcctattaggattgcagaaagtggggaaagg 
agactggagaggaatctctagaaacttcgtaaagacacgtacaccgacacaggttgcaagtc 
atgctcagaagtacttcctccgacgaagcaacctcaaccgtcgtcgccgccgatctagcctc 
tttgatatcaccactgactcggtatcagctatgccaatagaagagggaaaaaataagcaaga 
aatcccagttccaccagttgtagcatcatcaccaacattgcctactactatagaggctacca 
aaaccaatgcatttccagtggcacctatcatgttaccagcacagattgatcagtcaagagaa 
agtccaactctgttgcaacgaaatcaagtgaattcgtatacgccagttcgccctcttcctat 
gctttcaatgcccaatccatcaacagtatttgaccttaacgtgaaccagatctcagaagtcg 
aaccattgtcactgagattatccttgtcacttgatcagggacaagcatcatctactagacac 
cactcggcatttaaagtaatgccaagcttcagtaatggagagagcatcattagtgtggcatg 
agatcgaaggatctgtgagaaaaaaatgaaagcaatatggaaagtaaaaataggacaagagt 
gggtacgctgcactcataattatattaagggaatgtttatttaaggagagattaattgacta 
gacatttggtcctgatttgtacagaccagaaatatgtcatgccttgtggttacctgtttaat 

gcaacgagtatactgac 
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ccacgcgtccgactttttccactgagctccactccaatgtgtaaaaccctagctaaaaatct 
ttaaagttagggtttcaaatttgcaatggggaactgctgcagatctccggcagctgtcgcaa 
gagaagacgtgaagtcttcaaacttctccggcaacgatcacggccggaaagacaagtccagc 
gccggaaaatcgcaaaaaccggtaaccgtgttaaccgatgtgaaaaattcgaacgttgaaga 
gaagtatttagttgatagagaactaggcaggggcgaattcggaattacatacctttgtatag 
atcgtaacagtaaagagcttttagcttgcaagtcaatttcaaaacggaagctacgaacagct 
gtagatgtggaagacgtgaggagagaagtagccataatgaagcatttgccggtgaattcaag 
tattgtgagctttagagaagcttgtgaggatgaaaatgcggtgcatttggttatggaattgt 
gcgaaggtggggaattgttcgataggattgtggcgcgaggacattatactgaacgagctgct 
gctgctgttacacggacgattgtggaggttgtgatgctttgtcataagcatggtgtgattca 
tcgagatttgaaacctgagaactttttgtatgctaataagaaggaaaattcgcctcttaaag 
ctattgattttggcttgtcaattttcttcaagccaggtgagaggttctctgaaatagtcgga 
agtccctattatatggctcctgaggtgctcaaacgaaactatggaccagaaatagatatatg 
gagtgcaggagtcattttatatattttgttatgtggggttcctcctttttgggccgaatctg 
aacaaggtgttgctcaggccatcttacgtggggtgatagatttcaaacgggaaccctggcca 
agtatttcagagagtgctaaaaatcttgtacggcaaatgctggaaccagatccaaagcttcg 
actgactgcaaaacaagtacttgaacactcttggcttcaaaatgctaagaaggctccaaatg 
ttccccttggagatgttgtgaagtcaagacttaagcaattttctttgatgaataggtttaag 
aggaaagctctgagggtgattgctgatttcttgtctaatgaagaagttgaagacctcagaga 
aatgtttagcaagatagacaccgataatgatggaattgtttcagtccaagaactaaaagctg 
gacttccaaagctcaactcacagctggcagaatctgaagtacaaatgcttgttgaagccatt 
gataccaatggcaaagggaccctggactatggagaatttattgctgtttcactccatcttca 
aaggatggctaacgatgaacatctgcacaaggctttctcctactttgataaggatggaaacg 
gttacattgaaccagatgagcttcgagatgccttgatggaggatggagcagaaaactgcgcc 
aatgtggcgaatgacattttccaggaggttgatacagacaaggatgggcgcatcagctttga 
agaatttgcggccatgatgaaaactgggacagattggagaaaggcttcacgacattattcaa 
gagggagatttaatagtctaagtgtgaagctaatgaaggatggatcgcttaacttgggaaat 
gagtaaggtttacattttttcatcaaaatgaagtattgtatcgatgtgtatttgatctcgat 
gtgtatttgatctctcgccattgttttctggggtgcccattagattgtttgcttgccaggat 
ggaaaaggggcgacttcatctgggtaaccgttgtaaccatttgaaacacagaatgtatcctt 
ctactccc 
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ccacgcgtccgcttggacttggttacaaaaacagtagccttaaatagtccagctactgccct 

tgccttgtacatgcaatgcaagtacaggctgccgccatggcgaatttctcaccgttgctatt 

aacgacgatttggctagtgcttgtaatttgtaaaggagtagagagtggtcattcctcagctg 

ttggagatccaggaatgataacagatggcttaaggatagctttagaagcttggaacttttgt 

aatgaagttggtgaagaagctcctggaatgggtagccctagagctgctgattgctttcatct 

ttctgacagttctctgactcacaaggtaaccgagtcggataataagctaggagttggcaaga 

cattccctggcctgagtcctaaggctaagaataatccggacttatatgctgttgaaaaggaa 

ctctatcttggttcattgtgtgaagttgatgacacgccgaggccatggcaattttggatgat 

aatgttgaagaacggaaattatgacacaaaatctggtctttgcccagaaaatgggaaaaaag 

tgcccccttttaatcctggaagatttccttgttttgggaaaggatgtatgaatcaacctatc 

ttgtatcaccagcccacttcattattagccgatgatattatgcggggaggttttaatggtac 

ctatgatttgggttcttcaacgggtggcagtagttccttctttgaggtgctctgggaaaaga 

aagttggcacagggggttgggtatttcagcacaaactcagaacctccaaattgtatccatgg 

ctgatgttgtatcttagggcggacgcgaccaaagggttctctggaggctaccactacgatac 

cagaggaatgttaaaaactctcccggagtcacctaattttaaggtcaaattgaccttggatg 

tgaagcgagggggaggaccgaagagccagttttacttgatagatattggcagctgctggaag 

aacaatggtgctccatgtgatggagatgtgctcactgatattaccagatacagcgagatgat 

cattaatccagaaactccagcttggtgcagccccacaaatattggcaactgcccaccttttc 

acatcacaccgaacaatactaaaatctacaggaatgacacctctcacttcccttactcagct 

tatcactattattgtgctcctgggaacgccgagcacttggaaaagccatatagtacatgtga 

tccttacagtaatccccaagcacaggagctagttcagttgctgcctcatccaatatgggcag 

actacggctatccaaccaaacaaggagacggctgggttggggatggaagaacatgggagctt 

gacgttggtgccctttccagcagactttacttctatcaggatccaggtacacctcctgctag 

aagaatatggacatctctggatgtggggactgaaatttttgttagcaacaaagatgaagtgg 

cagaatggactctgagcgactttgatgttttaatcacctcgtaaagccataataatgatacc 

cttctatttaacattgtaactgtagccaaagcaaaatcagatagtgggacaaggtctcatca 

ttcttgatgtctaaactttatctttctatactagatctgatctgacggggcaagtcctggca 

gctttatttccgagagaagaaaaaagaattttgtttttgctttaaaaaaaaaaaaaaaaaag 

ggcggccgctctagagtat 
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ccacgcgtccggcttcactatcttgagctggccatattttcatgcttgcaggaactgattta 

ctgctttcgctgggtggcgtttttcaagtttaaagtttaggtcaaactcagaggattcgtca 

tagtttacatacttttctgctggtcattaacgaaactatgtgtactgaaacacgggggtgat 

tcaattcttctttgtcacagatattaaagctggactaaagcatctacagactggactctatc 

ctgtagttagatatgcagactaagaaaaaattaaatggaagaaatccccgagagctggctag 

tccaaaggtttcaagacagcagcggaagatgtccgagaatgtgcaaactcaggcaaagcaag 

ttaaggaacttataacatctacagtgaggaagcaaaaatcaggaagcaatttcttgaaaaag 

attgagaattatgttgctgctacagatctggatgtaagatttggattggtgtctgatgacac 

ttctgctgcttcagacgcacatgatgctgttcatgaatataatactattactattaataagg 

actataatgttgaaactgatagttgcacaaatgatactatattttctcctaccttccatata 

tccagaactattggaggggaaatttctaacagagcagacatacccaaattcattgagcaagc 

agaccagccattgcaggagcctggaaaggaaaatatggaagttgatctgctgacaagtcatt 

ttgtgctggacgaggctaccgatatagggggccagcatatctcctctgaagtttcagctgtg 

catctctctattaaagattcaaaactggaatgcattgatgaatttaatcaatttcagttgcc 

tgctgatgttagtatggaggaagaggaaactgaagagtttgatgactttgatccatattttt 

tcataaagaatttaccagacttgtactcagttgttccaacatttcggcctgtgctattgcct 

aaacaaacacggagttgcccatcaaccactcttgttttggacttggatgagaccttggtgca 

ctctacacttgaaccttgtgatgatgcagatttcactttctcggtgaatttcaacctgaaag* 

atcataatgtatatgttcgatgccgtcctcatcttcgggattttatggatagagtatccagc 

ctatttgagattatcatatttactgcaagccaaagcatttatgctgagaagcttctgaatgt 

gcttgatccaaagagaaaagtatttaggcatcgtgtttaccgtgagtcatgtgtatttgttg 

atggcaattaccttaaagatctgtcagttcttggccgtgatttagcacatgtgattatcatc 

gacaactctccgcaggcatttggattccaggtggacaatggtattccaattgagagctggtt 

tgatgaccgctctgacaaagagttgctctctttgctcccatttctggaaagcttagttggag 

ttgaagacgttcgaccgattattgctagcaaattcaaccttcgcgagagaatagctgctgct 

gctacttgtccttttaactctattagaggtgatgcatttgagagatagggatccgtgtcttt 

atagattcagtcttggttacttgaattttagatttcaatggctctcgatgagttgcaggaat 

cagttctaatgtacctttgcggatgtgagtttgctagaggctgatctctaatgttggttaat 

ttatgtaa-ttcacatttatgtaatggtgccataacgacgcttgagattggaggaaacttctc 

aataaggctgtatctgaaacgtgaaatcatccaagcgag 
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ccacgcgtccgcttagggttccaaattgctctaaattcccgcggattgagagttcattggag 
acttccattgttcccagcggctaagatgagccggttgattgagcatcacctagcaaataata 
aacaggacatgaaagggacagaggtttttgttggtggtttggcccgtactactactgaaagc 
aaaattcatgaggtattttcttcatgtggtgagattgtggaaatacggttgataaaagacca 
gacaggcgttcctaaggggttttgctttgtacgatttgcaacaaaatatgctgctgacaaag 
ctctgaaggaaaaatctggatatgtgctggatgggaagaaactcggggttcgcccctcagtt 
gagcaggacactttatttcttggaaatcttaacaaaggttggggtgcggaggaatttgagag 
tattgtgcgccaggtttttccagatgttgtatctgttgatcttgcacttcttggagatgtcc 
aacctggtcagaagcaacggaatcggggttttgctttcgtgaaattcccatctcatgctgct 
gcggctcgtgcttttcgggtaggctcccaatctgattttctcattgatggcaagttacatcc 
atctgtacagtgggctgaggaacctgatcccaatgaacttgctcagatcaaagcagccttcg 
ttagaaatgtacctcctggtgctgatgaagattacttgaagaagctctttcagccctttggc 
aatgtagagaggatagctctatccaggaaaggtagctccaccattggattcgtttacttcga 
taagcgatctgatcttgacaatgctattatggcgttgaatgagaaaactgtacaagggccaa 
tgggaggtccctcatgcaagcttcaggtcgaagttgctaggccaatggacaagaacaggaaa 
cgaggtcgtgaggatccaaacatgtccagtaccattgagagtcattccaagcttttgaagga 
tgatccagatgttgagatgattagggctcctaaatcaactgctcaactggagatggattatt 
cggatccttatgaagctgctgtagttgcattacctgtggttgtcaaggagcgtttagttcgg 
atcttgcggcttggtattgctactagatatgatatagatgttgaaagtttaaccagtcttaa 
gatattgccccagtcagctgccatatctattcttgaccagttcatgttgtctggagctgata 
tgcagaacaagggaggatatctagcttcattaatttctaagcaggttgaaaaactgggaccg 
aaacaattcgatagtaggtcaaggatagaagatgttggcttgagggtgccagaaccagacag 
gttctctacaagagttcgtttgccagatctagattcatatgcctcacgagtacccttgccca 
tgcctaggactgatgtttacacatctcactattcagcgtatttagatccccatctgtctggt 
cggatgacagcaaagaggatggaggaagcaagttcccatttgcaggcgac-ttcacttctgtc 
tagtcgggtggcaacgaggatggaggaggcaggttccactttgcagtcgctcctatctggtg 
gggtgacgacaagaaggatggaggaagcaagtccgattttgcaggcaacactccttccatct 
ggtcgggtatcaaggatggatgaagcaagtcccaatttgcaggcaacatggagcccttctcc 
tactaatgacagaattggacttcattcacacattaccgcaactgctgatcatcaacatactc 
gaccacggatcaggtttgatcccttcactggtgagccatacaaatttgaccccttcactggc 
gagccaattgttcccaagagctcaagtcatcatcgaagcctgtactgaacgttctgagcatt 
ctaatttacaaatggcttattgccaaacctatgtaacataatgatgcgtatttttgttcatc 
cgcagctgtaaaatagtagctgttagcaggattatttggttatgtttctcattgacttcatt 
gattgcgaaggtgcatttggaatctcggcaatcacaatttatagccggtgca 
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ccacgcgtccgcccacgcgtccgcaaaccctcccgcgaagctaaatttcccctttttctctc 

tctctttgattcgaataagagaaattgggggtttacagtaattggggttttcgtatatttag 

ttctgtaaactcatccctcgactcgattcttcttttgatttgcattgattatcattagattt 

gactttgattttcaattcaattctaattgatggaggatactaatcagcagaacgtcgatcga 

ttcacgtctgtttcatcttcaggtgaaagagccgttgagccacataatgctgcagaacagcc 

tatttcgcccaaagatgaaaggactgtttctgcaaatgcttctgtgaatgcaatcatccctg 

gggctttaagaaatgctaaagatcaccctgttacctctgaaactggagctctgtccgccttt 

tatcctctcaattcctattctcctcaggaccaaggtttttactatggaggttacgacaacgg 

cactgggagttgggccgaacaatccaatgatgtcaatgtgaacttgcatgtagttccgccag 

caatgtacaatgagaatcccctctttttccctccgggttacggctttgatgctcagatggca 

tttggacagttctcccccatcgctagtcctctttctccttttatgatagatggccagctata 

ctccccacaccagattccggtttctccaaattactatgcaccacctatttcccctggcttgc 

cgcatgttacatcagctcttccagcttcgcagcctgatctggtggcaccaggaagcactggc 

catgaaattgatagcatgtattttgggccaggatcaggttactacatacccgttggatcgtt 

tggcggaggcgagctttcgggaagcagcaacattggtttctacaattaccaaggtgaatttg 

gatctggtcaatctttacctaatcgacctaaccccctggactctggaagatacatgtctcaa 

atgacatctgcggcactatatccacaaccagttggcatacttgggtcgtacgaactaaacgc 

catgcaggcttcacatcaaggtcttggattcacaccaggctcctcaggcaggaattattccc 

aaggcaatccttatcctagtgcaaactatggtactgggtctagttctctgtgggaaccaggt 

cacagaaattggctaactcctgacagaggtggaagacgtgagagggatcggcactctgttaa 

catttctactgaatcacttggtatggcaagtgaacgaaaccgaggaccaagggcattaaaac 

caaagagcaagggtcccattgaggatagctcttcatctgtcatccgtaaagaagttgagtcg 

actaatactttgcagcctgagcagtataatcggcctgaatttgtcactggttatgaacatgc 

caagttctttgtcatcaaatccttcagtgaagataatgttcacaaaagcatcaaatatagtg 

tgtgggctagcactcctctgggaaatggaaagcttgatgctgcttatcgtgaagcaaaagag 

aggaatgctgattgtcctgtttttctctttttctcggtgaatgctagtggacaattttgtgg 

ggttgctgagatggttgggcctgttgattttgagaacaatgcggagcactggcagcaggatc 

gatggagtgggcaatttcctgttaaatggcatgtcattaaggacgtgcctaacagtcagttc 

cgccacctacttctggaacataatgacaacaaaccagttactcacagtcgagattctcaaga 

ggtgaaattgtcagagggactagaaatgttgaaaattttcaaaaactatgaagcggatacct 

ctatattggatgatttcacctattacgatgagagggagaagtccttgctggaaaagaagagt 

aaacagcgaacacttcaacctggtagtgctgcagttactactgcagctgacacaataagtca 

actagcggatagtcttgccggcacattaaacttggaaggcaacaagaaattgccttaaaaag 

agtttgtaatgcttaagcctgtagcagattccagaggcaatatcaactgctgtcattcaatg 

ttagttggttgggcaaccagctggctcgtatagttaaggagattgctaacagcattttggga 

ggtggccttttcatcacctttggatcagaaaactctctttatctctttcctttatttgtcat 

ttgttcaatgagttggttgtgatttgcaattcttgggcaaggagaggcagccagtaatgtta 

tgagctatcgtttaccc 
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ccacgcgtccggggttggcaagaaaaattctaaagagaaaaggaggaacacattgattttac 

gaggactagtcattcatttttcttggacagtcttggaaactaacagcttgattgctgaccct 

ttctcctatcacggctagacagccaaactccacattcttatataaagaccacccttttcatt 

ttggatttggtaaaacaaaggaagtccagaagataatcagagaaagatgaaatttgggaaag 

aatttgcatcccaaattgtccaagaatggcaagaagcctatgtggattacaattatctcaag 

agtgttttaaaagacatcttgaatttcaatattgccgcttcacctgaagttgaaggctcctt 

aaagagaaggctatctatgtacagagcctttagtggattacaaactagtttcaaagtttctc 

aaaacaatgaagatgaagccatattagtgagttcagaaggccactatcaaactatgtttctt 

atgtcatctgaaaaaggtggagaaaatgagatggttttctttaaaagacttgatgatgaatt 

caacaaggtgataactttttaccagaaaaaagtagaggaagtgaaggctgaggctgatgagt 

tgagtaaacaaatggatgcacttattgctctaagaatcaaggttgataagccttctataaga 

atcaaaaattcccatttgggaaatccaggtaggtcagaaatggaggcaatacaagaagcaga 

gatgacaagtgaagaagaagaagcaacaagagggaaaagagatacagcaaatacaaaacata 

tggaatttaggcctgctccactagagattttggaccatataaaaatcaatcttgaacccgaa 

acacctgtctcaactttaaaaaatatcatccatacttcaaaatccaacttatcattcagcaa 

agaggagctcagaaaagctgaagaacaaatgagaaaggcttttgttgagttctatcaaaagc 

ttcgacttctgaaaaacttctgtctcttaaatgtgttggcattttccaagatcatgaagaag 

tatgacaagatcacctcaaggaaagcttctaaatcatacttagagatggttaataaatctta 

tcttggtagctctgatgaggttgctaagctcatagaaagagtggaggccacattcataaagc 

attttgtcaatggaaatcgaaggaaaggaatgaaatctttaagaccacaagctaaaagagaa 

acgcatagagtaacatttttcctgggtttgttctctggcggctcaatagcattagtggcagc 

tattgctgtatccatacatttcggaaaccttctacagcatgagggtcgtgggcagtatatgg 

aaaatatatttccactctacagcctattcggatacattgtcctccatatgctcatgtacgcc 

gggaacatatactactggaggcattttagagtcaattatcccttcatttttggcttcaagca 

gggaacagaactaggttacagacaagttcttttccttgcttctggtctttcagtacttgcat 

tggctgctgcattgtcccacctagatatggagatggatccaaatacaagaagttttgagaca 

gtgattgagctgatcccacttgccgtggtgtttattctgcttctaataactttttgccctct 

gaacatcatatatcgttcaagtcgcttcttccttataagatgtggttggcactgtctatgtg 

ctcccctttacaaggttaatctaccagatttttttcttggcagatcagcttactagccaggt 

tcaggcaattaggagtttgcagttctatgtctgctactatgtgtggggcaacttcagaacaa 

gatctaataaatgtcaagaaagcagtgtttatcaaatcttatacatagtcgtcgcaattatt 

cccttttggtctcggtttattcagtgccttcgccgcttatttgaagagaaagattcaatgca 

ggggcttaatagcctcaaatatttctcaaccattgttgctcttgtgatgaggacactttatg 

ctcaaaagagaggaacgttttggagagtaatggcggcatcatcctcaggaattactacagtt 

gcaaatacttactgggacattgttatagattggggtttattgcaaaagaattcaagaaaccg 

ttggttgagagacaaactgcttgtgccacacaagattgtctactttgttgccattgttcttg 

acattattctgagactagtatggatgcagttggttcttgattttcaagaactaccatttctg 

cacaagaaagcaatggttgcagtagttgcctctctagagatccttcgccgaggcatgtggaa 

ttttttcaggttggaaaatgagcacttgaataacgtcgggaaatatcgtgccttcaagtccg 

taccattgccttttaactacgaggaggacaagagt-ctatatctatacctctagctgatacgc 

agaagtcgaaggaatccagggttttcttttctttctttttttttcttgcacaaattcttctg 

attcgttgccgtatattggt 
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ccacgcgtccgaaaccataaacagagcagagagcgattgagagagagagagagaaatggaga 
ctgtaaaaaagagtgcatcggcaatggaagcattcgagaagcttgagaaggtaggggaaggt 
acttacggaaaggtgtacagagcgagagatagggttactggcaaaatcgtagcactgaagaa 
gacgaggcttcacgaggacgaagaaggtgttcctcccactactctccgcgagatctctcttc 
tgcggatgctctctagggatcctcacattgtcaaactgatggatgttaaacaaggccagaac 
aaagaaggaaagacggttctctacttggtctttgagtacatggatactgatgtcaagaaatt 
tattcgtagtttccgcgcaaatggagaaaacattccccctaaaactgtcaagagcttgatgt 
accaactatgcaaaggagttgctttctgccatggtcatggcgtgttacacagggatctgaaa 
ccacacaatcttctgatggaccgtaagacgaatgtgctcaaattagcagattttggacttgg 
cagagcttatactctgcccatcaagaagtacacgcatgagatattaaccctatggtatagag 
cccctgaggttcttcttggagctactcattactccacagcagttgacatgtggtctgttggt 
tgtatctttgctgaactggtcacaaaacaagccctcttcccaggagactctgagctgcaaca 
actgcttcacattttcagattgctaggtactcctaatgaagaactctggcccggggtgagca 
agctagtaaactggcatgaatacccccaatggaacccccagccactctcaactgctgtccct 
ggtctagatgaagatgggctccaccttctaactgagatgttgcattatgagccagctaagag 
gatttcagcaaagaaagctatggaacatccctatttcgatgatttggacaaaactcctctct 
gaagtcccgctcatgacccatctgttgaaaaattgcaaatttctcatcaccggagatcaaca 
aacccatctaacccctcatcgcaagcttttattgcttttctcaagcatcttttaatagtatc 
aattagtatgactagcttcacctaaaaactttgtctttctatatcaattggatcagtgtagc 
acaattatgtggaatgatagaaccgca 

SEQIDN098 

ccacgcgtccgatttcctcggctattttctgcactgactcacgatttttcggacgctttgtt 
ctcgccgtagcgcggatattatacactttgtacaatctctgtagtgatcgccgattgatttg 
ccgctccggtgaagttgtccttgccgaaaattttctctcagatcttgtgaggcaggtggctg 
cagttgttgtaaaggttgaagtagctctagacaaaagcatttgcatgttgaccagatgagca 
gaactgatgttatttgcagtagaaggaggaggtttcttctcgtcttcagctrctggatatag 
taagggcctgacccttctactcttgggtcagaagaacgaagagaagcccatgagagttgcac 
cgtggaaccagtaccagttggtggaccaagaaactgatccggacctccagctggcttccggg 
aagaacagggttgtccgcgggtgcgcctcctttgtatgctttggtcgcgctgccgctggact 
tgagagcccatctccccttaaagtcggtcctacccaacagccagaagtcttgcctagctgtc 
ctgcttctgacaaggacaacaatcagtcgcagtgtgttaatattattgaagacagtcatatc 
tcaccaaaggttgctcttcggagtagcttaaagaaaccagcaaatagtatacccatttctgg 
tggtaatggtaatgaacgcggcacaaattctctaaagattgatgatgcccccaatcctatgg 
agaaaaggaaagtgcagtggacagacacatctggaggagagctttttgagataagggaattt 
gagcctagtgatgatggtgaatcagatgatgaatttgagagtgggaatgaaagaacttgttc 
ttgcaagataatgtaattttagctccttatagaaggtttcatggctgattttcatgtaggag 
gcaacaattgaggtgctgcagatgatattacgtgggaggttggtttcgacacatgccttttg 
ctcaactatcagtcgacacaagtgaattttgaggctttattaggtagcgtcacgttgatcat 
gggcttttctggttttggctgactaatctgttgtctatattcaaattcttgaatgtcagttt 
tgtttctctggggcccgctcgtctattgttttcataattatattttatcttcatttttttaa 
ttaaaagatttctggtcctcttttatccaaaaaaaaaaaaaaggacgaccgctgtagggt 
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ccacgcgtccgagcatatattcttcttcttcttccttttggcttctatttcattagcttata 
aaaaccaaaacacaaacccacaaacaaagaaacgttaattagctttgtggagtggagagacc 
ttttattgaaagggagcttaactgttggaccagctgagcactacttgccaaagatttgaact 
ttcttggttttttggactggaagttagataaagctcaaatcttttgggtttattttgtatct 
ggtacagtttcttgaacaagaatagcaggaccttcaaggctgaaaagggaatattttcggct 
tttttggagtgatttgttggatgatcaggaaattgttttgtatttgagggaagcagttaata 
aggtagaatagaaatgatggatagggtgccaagattgcttatggaggttctaacagaaccgc 
aacgaggaggagagtctttactcgggtcgattaagattgctgttttaccgattgcaaaagtt 
tttaccatgtgcttcttgggatttcttatggcctccaagtatgttaatattcttccagctaa 
tggacggaagctcttaaatgggttggtgttttcacttttactgccctgcttgatattctctc 
aacttggacaagccatcacatatgagaaactgcttcagtggtggttcatccctgttaatatt 
gttatcgccaccatatctggctctattataggttgcatcgttgcttcaatcgtccgtccacc 
atacccttatttcaagtttaccgttgtacaaataggcattgggaatattggaaatgttccac 
ttgttctgatagctgcactatgtcgggataaatcaaatccttttggagactacgagatatgt 
tcgcgagatggaaatgcatacatctcatttggccagtgggttggtgcaatcgttctctacac 
ctatgtgttcaatatgctcaaacctcctagtgaaggcactttcgacgttcaagatgcaaatc 
ttcctatcaaaagtcctaacaaagatggctcgcagagccatctggtagctagttcaccagag 
caagttccattacttacaacagacgtagcaccggctgactcaagtggttcaaagaaagaaaa 
ggttaaagagttctttaattttctatatgagaaactgaagctcaagcaaattattcaacccc 
ctattatagcttctgtcctagccattgtcataggatgtgtgccaatcctgagacgacaggtc 
tttacttctgatgctccactttacttcttcactgacagctgtttgattcttggggatgccat 
gattccctgcatattgctggctttaggaggcaatcttgtcgatggaccaggacccggaagtt 
caaaacttggcctaaggacaaccgctgcaattgtctttggacggctggttttggttcctcca 
accggacttggcattgtcatgttagctgataagcttggattccttcccgctggtgataaaat 
gttcagattcgtactcctccttcagcatacgatgcccacatccgtactttctggtgctgttg 
ccaacttgagaggatgtgggaaggaggcagcggcggtattgttttgggttcatatttttgct 
attttctctatggctggatggatcatcctctacctcaacatactcttttaagttaggatcaa 
acagtgttgctacaaagagtaaaaagaagagatcttgggatggaaggtttttattcctgtta 
ccaggatcgcgccagcctttcgtaaagctgctgtttttagctcattcaattgcctcattgcc 
atttgagactaagagagagatgtattaatattatgtaggaatattacctactacatctataa 
gtataattagtcatgatggagttaaccaattgctccttatttgttcttggcttcttctactg 
tataaccttagcttatgctaccttgaaactggctatgtcaaagttggacttggcatttggca 
gacaaagatgagacatgatgttcattggaagaataagtaaacgttgaacagc 
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ccacgcgtccgagcttaagaaaagaacacttgccctgctgagtatctataaccatatagata 
ccagtaccataattttgcatcatcttcaactctaagggatcaaagctttattgtccaaagaa 
aaaacttatggactcccctcataagccaaggtctttttcacctaacctattctttttcttcc 
ttcttgtatcctcaaatctcctgacttttttcatttctaacatatttaagaactcttcttgt 
tctctataccagcaaacatataaagccattgccactgcttcattaaacaatgctatcccttt 
tgttgttaagtcagaaactagagatgttgttcatgtatctgataaaccagcagctgatttag 
acttaccatctgagttccttgctttcacatctccacatcaactgccatttggagtcaactcg 
agctttaattctgacaaactcatccctcccgttggccgtccatgtactatgtttccagattt 
acttcggcgttacatgtcatacaaaatcaatggatcttgtcctgatgatgagctcctggcac 
agaagctgcttctcaaaggttgtgagcctctccctcgccgtagatgccgtcctgcagctcaa 
caggaatatgttgagccttatcctcttcctgagagcttgtggattactccatcagattcatc 
tgtagtttggacagcatatacatgcaaaagctatgaatgtctaatcaacaggaaaaaaaatc 
aaaaagcatttgatgattgcaaagactgctttgatctcaatggcagagagaaaaatcgttgg 
ttgtccaaaaagggagctggccttgacttctctattgatgaagtactagcagtgaagaagcc 
gggtacaatCagaatagggcttgacattggtggaggcgtagctacatttgctgtaagaatga 
gagaaaggaacataacaatattaacaacttcaatgaatctcaatggtcctttcaatacattt 
atagcatcaagaggagtcgtacctttgtacataagcatttcgcaacgacttcctttctttga 
caatacactggatatagttcactcaatgcatgtgctgagtaattggatacccacaactctgc 
tccacttcttattattcgacatctatagggtgcttcggcctggtgggctgttctggcttgac 
cattttttctgtgttggtgagcaatttgagcaagtctatgctcctcttattgacagcattgg 
gtttaataaggtcaagtgggtcgttgggcgaaagatggacagaggccccgagctgaatgaga 
tgtatctttcagcactgttggagaagccactgaagaactcttggtgatagtattagatgttt 
cttttactttcttacttttgatagttatagaagagaagatagaaggaggtgtttattttttt 
taaaattatagattcatttcagatgacatcttctggcataaactagcagtttgaggtagctt 
gagtatgattttgtaatgttggtgggctaaaccttagagctttagcggcc 

SEQIDNO101 

ccacgcgtccgtgatgctgttcatgtatctgataaaccagcagctgatttagacttaccatc 

tgagttccttgctttcacatctccacaacaactgccatttggagtcaacccgaactttaatt 

ctgacaaactcatccctcctgttggccgtccatgtactatgtttccagatttacttcgtcgt 

tacatgtcgtacaaaatcaatggttcttgcccggatgatgagctcctggcacagaagctgct 

tctcaaaggttgtgagcctctccctcgccgcagatgccgtcctgctgctcaacaggagtatg 

ttgagccttatcctcttcctgagagcttgtgggctactccgtcagattcatctgtagtttgg 

acagcatatacatgcaaaagctgtgaatgtctaatcaacaggaaaaaaaatcagaaagcatt 

tgatgattgcaaagactgctttgatctcaatgggagagagaaaactcgttggtcgtcgaaaa 

agggagctggccttgacttctctattgatgaagtactagcagtgaagaaggctggtacaatc ' 

agaatagggcttgacattggtggaggtgtggctacatttgctgtaagaatgagagaaagaaa 

cataacaatattaacaacttcaatgaatcttaatggtcctttcaatacatttatagcatcaa 

gaggagtcgtacctttgtacataagcatttcgcaacgacttcctttctttgacaacacactg 

gatatagttcactcaatgcatgtcttgagtaattggataccaacaacactactgcacttctt 

attattcgacatctatagggtgcttcgacctggtggactgttctggct.tgaccatttcttct 

gtgttggtgagcaatttgagcaagtctatgctccccttattgacagcattgggtttaataag 

gttaagtgggtcattgggcgaaagatggacagaggccccgagctgaatgaaatgtatctttc 

agcactgttggagaagccactgaagaactcttggtgatagtattagatgtttcttttacttt 

cttacttttgatagttacagaagagaagacaggaggtgtttttattttttattttttatttt 

tttttaaattatagattcatttcagatggctcttcaggcataaactaacagtttgaggtagc 

gtgagtatgatcttgtaatgttggtgcggcta 
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ccacgcgtccgcaaaatccacaacaatctcaaattggattttctaatctgtaatattaatta 
cttcattcaaattatgtaaattcttttgtataaaaacccttaaaaacacaatcttttcatca 
attctcaattgggtttctcttcttaatctgtaagtttttgttacttcattcaattttgtata 
atggattctgatttttctcccgggtgtgggtcgggtatacaatcagactttgcgttcgcttt 
caatgatagcaatttctcggatcggatcttaaggattgaaattgtacccgacttgccggatt 
gtaaaacgggctgtgaaggttgtactggcggcattgatgattgggcccggaaccgcaagcgt 
aggagagaagacatcaagaaagaaaatgatgcggacgtggtcatgcaacgtgaggagcaagt 
agtaaattgtaatgtgcttgaaatggaagatggtcttgctgatgatgaacaagatgaagaag 
ctgtaggaatgcttgaggaatcaccctctggcattgagatgaccacaaatccccctggcgat 
gatgaagcttctaaaagcgatgatgattcatctacaaacatggactcttcaaccccccttcg 
ggtgagaactatacatatcagttctcccattttggcagctaagagtccattcttttataagt 
tgttctcaaatggcatgaaagagtcggaacaacggcatgtaaccatacgaatcaatgcgtcg 
gaagaagctgccctcatggacctcttgaattttatgtatagcaatactttatcaactacaac 
ac t cact gccgt get t gatgtgttgatggct get gacaaatttgaggttgcgt cat gcatga 
gatactgcagccacgtactgcggaatcttcgcatgacttgtgaatcagcattgctttatttg 
ga t ct t cc t tccagtgt act aatggctgatgcagttct gccgt taacagatgctgcaaa a ca 
gtttcttgctgcacgtttcaaggatataaccaagttccaagaagaggtattgaatttgcctc 
ttgcgggaattgaggctgttctgtccagtgacgatcttcagattgcttcagaggatgctgtc 
tatgactttgcgttaaagtgggctcgcatccattacccaaagcttgaggaacggcgggaagt 
attgagctcacgtctttgtcgactcattcgatttccatgcatgacatgcaggaagctgaaga 
aagtcctaacatgcaatgattttgatcctgagcttgctacaaagcttgtcctcgaggctctt 
ttttataaggccgaagcaccatatcggcaacgctccattgctgcggatgcagggaatgcttt 
gtgccatcgttacatggagagggcatacaaatacagacctgttaaagttctcgagttcgaag 
cacctcgtcaacagtgtgttatttacctagatttgaaaaaagaagagtgtgctagcctcttt 
cctgctggtagagtttattcacaggctttccatttgggtggacagggatttttcctgtcagc 
tcattgcaacatggatcaacaaagtgcattccattgctttgggctgtttctgggcatgcaag 
agaagggggcagtgtcatttgcagtcgactacgagtttgcagttcgtaccaagccaaacgag 
caatacatgagcaaatacaaagggaactacactttcactggtggcaaggttgttggctacag 
gaacctgttgggtgtaggttggagcgcgtttttggctgacgatagtgcttacttcatcaatg 
gacttctccatcttcgagctgagcttactatcagccaatagagagtttaaatactatcgctg 
tgcttctgctgacagctaaactatactttttacttcagtgaggccttaagaagtttacattg 
tagtggcatcttacttgaaagtgcagcacatgtgagcaatagttgtatgggctatatattgc 
ttgttacctattggcatatatgcactggtgtaaattagtaaaatcagtctttgagcggttca 
tattttgacaatcacagtctttgttaagagttctagctgccc 
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ccacgcgtccgcccacgcgtccgcgctgagcgtttcaacgacttagcttctgttgattctga 
tcaactgctttcgatcccgtggattcgtaaactcttggatgttttcctctgttgccaggaac 
aattcaggtccattgtgtttaacaacactgcttacttgaataaagctcaatggaccgttaca 
ttactgattatttcgataggagtgtgaagggtttggatgtttgtaacgcgataagggacgga 
attgagcagatcaggcaatggcagaagcagatggagattgttttgtgtgcattggagaatca 
gaggagtgttggtgaaggccaatttcgtcgcgctaagaaggcgttgattgatttgactattg 
gtatgctagacgataaggattctaatgcaactgttaaccatagaaacaggtcattcgggcga 
aacaatactcagaatgatcataggtctatggggcattttagatcgttatcgtggagtgtatc 
gaggaattggtctgctgctaagcagctccaagcaattggtaataatttagttgctccgaaaa 
gtaatgaaattattgctactaatggattagctttggctgtttttacaatgagttatgtgttg 
tactttgtaatgtgggcactagtggctgcaattccttgccaagaccgcggcctgcaaacaca 
tttttatgtgactaggcaattcgtttgggccggcccaattttgtgtcttcatgaaaggattt 
tggaggaatcgaagaagagggatcgtagaaatgcttgtggattgttgaaggagattcaggag 
attgagaaatgcgtgcaccaaatgaacgaattgatcgatactgttcagttcccaatcacaga 
ggaaaaagatggagaagtaaaggaaagaattcatgaacttgggcttgtctatgatggtttaa 
agagtggattggatcctttggagcgccaggttagagaagtgtttcataggatcgttcggagc 
aggactgaaggccttgactctattggaagatgaaatcatgagtgacaaatttgtgtagaatt 
gggtgatggttgtcttttgagaaggcatctattattagaacatgagcataatatgatataga 
ttttccccttttttttcttttctttgtttgaccctttttttagatgagaagaggagggagaa 
tggttaatggtgtaatgtgcctaaagaataagtagtttaaagaggtgaaaatcatgtatttt 
cactttatatatgtaaagaaattaggaaaaaaggtggatctttggtccatttttggtgatgt 
tcatcttgtttggaattgtataatcacatttctgagttaagttgctttttaaaaaaaaaaaa 
aaag 

SEQIDNO104 

ccacgcgtccggttttcttctcatcccaaatcgcactctagggttacgccgcctctatcagg 
aaatcatgcctcgccgaagctctggaagatctgctcctcgtcctgcccctcgtgcggcccct 
cgtcctgctccagctccagtacaccatgctcctccaccagctcctatgcaaagtagcggtgg 
tggatccatgcttggtggtattggttctaccatagctcaagggatggcctttggtactggaa 
gtgctgtggcacacagggctgtagatgcggtcatgggtccacgcaccattcaacacgaaact 
gttgcttccgaggtacctgctgcagcagcagctcctacaaccatcggtgctgggtctgatgc 
ttgcagtatgcactctaaagcgttccaagactgcatcaatagctctggaagcgacattggca 
agtgtcaattctacatggatatgttgtccgagtgcaggaggaactcaatgctgaatgcttaa 
gcttgttgtgtctcattttaataactttgaactcattcttaatctgattgttgaaacagcga 
tggaattatgacaaaaggcttggtggtattgatggagcaagtgaatttggttcttgatacac 
ttttgggtcaaataatttatgctgaaatatgaactttatagacttctta 
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ccacgcgtccgcccacgcgtccgcccacgcgtccggccttggctctcactttcaaattcccg 
acctctttctagcgccgaaattaccagcacgcagagcaaggacaccattagccgttcggcca 
ctgacccaaatggaaatcggcggattagactccgatggccgtgaatttagaaacgcggacga 
gatgtggagagaagaagtaggagatggtgaccaccaaaagaagtctcaatggtataacaaag 
gcatcaattactgggaaggtgtggaagccacagtggatggtgtgctgggcggatatgggcat 
gtgaatgaggctgatataaaggcaagtgaggaatttctcaacaccattttgccagaaaggtt 
ccctgatgctggaagaggccgccatcttgtagctctggattgtggatctggcattggaaggg 
ttaccaagaatcttcttatacgatatttcaatgaggtcgacctactagagcctgtatcacat 
tttttggaatcagcccgggtaaatttggctcctgaaaatttaatggtgtcagagttgcacaa 
agctgccaatttttattgtgttccactccaggaatttactcctgatgctgaaagatatgatg 
ttatttgggttcagtggtgtattgggcatcttgcagatgatgactttatttcattcttcaag 
agagcacaggctggcttgaaacctggtggactttttgttctaaaagagaacattgcaaaaac 
aggatttgtattggacaaagaagataagagcatcacaagatcagattcatattttaaggagc 
tgttcaatcaatgtggactatacatctacaagatgaaggatcaaaaaggatttccagatgaa 
ttatttgctgtgaagatgtatgcattgactactgagatgccaaggcaaggtaataaacctag 
acctaaacggacaactaatagacctgctatcatcagatgatgaatatcacattggtgttg-tg 
tggttttactaactttggatgaagtaattcataggttattgtttttaggtcacatgtatgcg 
agttctgtcaatgttatgttattgcttttggatataagttatatacattgatagtgaagaga 
tttgttgtgtactttagcttattgtaggttacttcttatgttgaattatttatgcaaccgct 
tttgtatcaatgtattctgctcttcttgtaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa 
aaaaccaatttaaaggtctgg 

SEQIDNO106 

tttgtacaaaaaagcaggctggtaccggtccggaattcccgggatatcgtcgacccacgcgt 
ccgcccacgcgtccgcaaggcttagggatgacgttgcccgataggaatgctaagaaaaaggg 
gaagcaaaaggcttccggtgagtcaaaagactcgcacgtagctgaagctcttgataagctta 
gggaacagactagagaggctgttaaaggattggaatcagtggccgggccgagacctggtgta 
gatagcttggggaatgatgcaatgatggaggagtgggttaagcagtttgaggagctttctgg 
atctcaggacatggagtcgatagtagagaccatgatgcaacagcttttgtcaaaggaaatcc 
ttcatgaacccatgaaagaaattgaagaaagatatcctaaatggttggaggacaacaaagct 
aagttgagcacggaagattatgaacgttacagacgccagtatgaacttataagagatctgaa 
caaagtttacgagactgaacctagcaacttcaacaaaattgtagagcttatgcagaaaatgc 
aagaatgtggccaaccgccaaatgatattgttcatgagcttgctccagactttgatatatca 
tctcttggacaactatccccagagatgttggagggccaacagaactgccgtgttatgtgaaa 
actgaaatgtccccccgcttgaatgtcctgcttgttttcgtcacctttgtcacagtttgcat 
acaacatttattttgct 
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ccacgcgtccgctatacctagatgacatttaccttagcccttaagccaaaaaagaaagaaaa 
gatccattgcctcatcctctgtaatctcatggattcactggtttcaatcctctgcatcttct 
tcttcttcaatataattcttaccccagttcatgctcaagtgatctttgaggatggttactca 
gttaagacactgattgatggccacaagatcaaaattaaccctcactccataatttctgtaat 
gggtgctggcaatttcatcattcttgattctgctgccagtactttttacaccttatctttca 
acaaaaactctgaattttctatttcgaagttaactggtagtgagactgctggctatgtggat 
ggttctctggataaggctaagttcaacaaacccaaaagctttgctgttgattcaaaagggaa 
tatttatgttgctgatatttggaacaagcatgcaattagaaagattagcaagtcaggtgtta 
ctacaatagcagggggttattcactaaagccaggccgtgctgatggacctggattaaatgcg 
tcattctcagctgattttgaactttcttttgttcctgagagatgcactttaatgat'ctctga 
ccgtggcactatgttagtgcggcaaatacagcttaaggccgaggattgttcaagagattctc 
attctgctctaagagcagtttctacatggttcttaaccgtggggcttccctgcttggtctgc 
ttgattctcgggttggtcatccgtccttatgttatccctaatgaacatggcagtcgtcttcg 
gcgcaacatgacatggaagcacttcctaatcagtctggagagacaagttctgatgttctgct 
tcggcatcagaagcgtagttgttgactcaaagatctattcacttttaaggaagctcgtgtta 
cttactttctcccatctgtgcctaatgtttagtcctaaagtagtagtatgccagacttctcg 
taaacaactggctcctctcttaagttttgacgactctgaaagcaaagaatcagcaaaatcac 
cggtggcagctaacattttggaggatttgataacttttgatggaagtttggttaactccgag 
ctgactactaatcaagatgatgcagtgagcaaaagtaccgatgtttctgttgtagatagcat 
gatactagctaatctaaaagggtttgcagaacaggggattgcttcttcagggcgtgaagttt 
catcgagcatttcgagcttagttaaccgaaaaaagaacgtaacttagtgaagtctagcagta 
gtatgtattactattaacttttgcaactgttctgaaagttcatcggtctatctgctaccact 
ttcatgtacatagtggaacaagcaaatgactcaaggcccttttgagttaatatttcctagcc 
tgtgttttcttggttccaaaaaaaaaaaaaa 

SEQIDNO108 

ccacgcgtccgcccacgcgtccgcccacgcgtccggaaaagaatcgcagtttcgaagctatc 
agaaaatcccaaacaacaaccatgtcttactacccaaaaggctaccacggggaagatgacga 
cggagctgaattcgacgagtacgatccaactccgtacggcggtggatacgacatcgctttga 
cttacggtcgtccgcttccaccctctgatgaaacctgttatcagacttcttcagcttctgat 
gaattcgactatgatcgtcctcagtactcttcttatgctgagccttctgcttatggtgagga 
ggctcttgagactgagtaccaaagctattctaggcccaaacctcggcccactccttcttatc 
atcgcccatctgaggaagaaggcgaagcttatgagcagcctcaggccgattatgggtttcag 
cctgggatgaatcgtcctggcagtggatatggtggggaaagtgaatacggatccgggtatgg 
acgcaagagcgagtatgaagaacccgcttccgaatacggatccgggtatgggagaaagaccg 
agtatgaagaacccaaaccggaatatgtagaacccgcttccgaatacggatctgggtatggg 
cgaaagagtgagtatgaagagcccaaaccagaatatggatccgggtatgggcgaaagagcga 
atatgaagagcccacatcagaatacggatctggatatgggagaaagagtgagtatgaagaac 
ccgctccagagtacggttcgggatacaggaggaagagtgaatatgaggagccaagatcggaa 
tacgggtcgggttatgagcgtaggaccgagtccgaagagtatggatctggtggatatggaag 
gaagcccagctacgggcaggaggaagagggggagaggaggcccagttatgggcgttcaagct 
accagactgaggagggagaagggtacgagaggcctcgctatggaaggtctgaggaggaggac 
tacaggaagcctagctatgagaggcgtggtgatgacgacgacgagggctatggtcgcaagaa 
atatggtgatgacaactccgatgatgacgaggagaagaaacatcaccacaagcaccaccacc 
gcaaacactatgatgattgagcagtgtgctttaatcatctgaaccagatttatgccatacta 
agaactattacaaaataaaagttggcaagtttgagatacattttgtttgtgaatgtttgcta 
tgatggctggactgtccagttatttatgtgatgtattttgctcttctgcaaatcccagacat 
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ttgtcagggttagtatgccatgaatgtgtgaactttatgatcatgatgactcttttatctct 
taaaaaaaaa ' 

SEQIDNO109 

ccacgcgtccgcccacgcgtccgcccacgcgtccgcccgagccagacgttaaacgacgtcac 

tttaatgtaccctttccccaaaaattggggctttgtaaattcatttgaacaattcacaaatt 

gtagaatttagggttctttttcagtaatggaaagtgggatttgtagtcccacaagatgaaaa 

gcagtcaagaagaagcgagccgcggcggcgcagcagatgaagaaagtgtgaaatttcacagt 

gttgtacagccacttagggatttagaatccaactggggtgttgatttagccaaaaatcttga 

agaatatttgctcaaaatttgctctggtgaaattactagagataattatgatgatggtcatg 

tgaattttgctgaagctgcattgctgcttcaggggtcagttcaggtgtacagcaggaaggtg 

gaatatctgtattctctggtattgcattgtttggaattcattaccaagaagagtgaaccaga 

tctaccagcaagtgtatcagcccaagaagatgaaaacggtttgcctgctgccgacaatgaag 

agaatgatccatactgggcttcagaagaaacctcagtggaagcaaagaacatgttggataat 

acgacgtgcagggattcttcatttacccagtttgtgaaggcccctgcaaatctggttgtacg 

cgaggctgactgcttggatgttactggagatgctggagaactagagtcttacctgctagcca 

cgtgtgatctttaccgagattttattctgttggacgcatgtgatgccgtaacagtggatgag 

tttctgaataatgagaatatagctggaaaggtgctgaacaatagctgcagtgcagagggcct 

ttctttggactccaagtgccacaagagcttttactctcccacaagacgttttgagggaactg 

gcaataagtcttcagctcaaaagaatcaggatgctaatttatatcagtctcaagggtttcat 

gagtttggtccaggcaattttaacaatgatcagttcgcatctgatatgcctgattacatcga 

tgatgcacatagatgtgaagatggatattcagaacctagagactcagacgaatcggatgatg 

aagacccatggaatccgttgaacccgcatgaacctggcactttgaaagtaaaaccatacaaa 

aaagttaaatttaatagaaggcagggtgcggcgtccaaaaaagttgcatctttggctacaga 

atttccagttgcgagattacatggtaccactagcgcagacctcaacgacatgtgggagagaa 

aatgttgtgccatgaaaaaacaaggcgactcacaatctcctccaccatatgagaagctccgg 

gaatcacttcttcatggggagaacaacgattatgatggtttggatagtccaaaggaaaagaa 

tgaaaatgatgactatgatagtgcagatcacgattttgggccttctgcctttgacatgccag 

aaaatgctgacatgaacaccgatgcaactccttatggggaaaagcacgataaatgtagtcca 

ttttttgacagtgaagctcatgaagattcgaatgctcatgccaaccttgaagatctttgtcg 

ctcccacttggattctcttcttgctagccttgctgaaactgaaaagcagagtgaattggctg 

cacgggtttcaacgtggaaacagagaattgagcagaacttggaggaacaagaatcacatccc 

ccctttgacattcatgaatatggggctagggttttgtgcaagttatccctggaagaaaatgg 

tcaaagcaccaagtctttttctgatgttgtcacgggtcaagagaagcatgatattgctcgaa 

cattttctgcgcttctgcaattggtaaacaacggagatgttggtttggaaagaggtggaata 

cgtgagtccacttgttacacagctgcaaatcccttctatgtccggctccttaggaatgataa 

tggtagggagaaaatgcagattcggtcatcaagaaagagagcaaaatctccaatacccaatc 

agggctttagaaaggaaaaaaacaaaggtaaagaagttcaggctgctttcagttcatcacct 

tcagaacccaactcaaggttaccgatttgccctgaagctgggaaaggttaatggaactcgtt 

gtacgcctgaaggtaagaaaagaaggaaatccagattagtcgtaccaccagatatacatact 

gcattgtgatatacattttgctctagttttcaagtaagcctctcctctctcgctattcggtc 

tcactgtgcccgttgtatgtgagagactcaaggcagtaattctgtttgagtgtagtaagaca 

gaagattaaccccaccatgaccactgtaattcttatcacaaaccaacaacctgttggctgca 

gaaatttgtaagatgtgtttattcttaacttaattaggacttactaatagtttggagcaggg 

aggatgtaacaatattttgacatagtgcagagctactcatcatagctc 
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ccacgcgtccgcccacgcgtccgccaaaatccatcacgaattgcattttcagatacgtgagt 

caactgctaatgggagaacacttggctctatgtgttgatcgtcttatcacacctaaatcttt 

gcactcgttgcaagggtcagaggatgcaggatcctctgcaggaagttcttgctcgcacacag 

taggtcaatcaccttatggtactactaataaggaggatgaagaactagaagctggaggtgaa 

gatgagccattacttcagactgtggaatgccgaatttgccaggaagaa-gatagcactaagaa 

tttggagattccttgtggctgcagtggcagcttaaagtatgctcataggaaatgtgttcagc 

gttggtgcaatgagaagggtgatataatttgcgagatttgtcatcagtcttatcaacctggc 

tatactgctccaccaccgccttctccttctgaagatattgccatcgacatcagtgggggctg 

gacagtggctggtactcagcttgacttgcatgatccgcggcttcttgcgatggcagctgcag 

agcgccatctcttggaggctgactatgacgagtatgctgattcaagtgctagtggagctgca 

ttttgtcgttctgctgctttaattttaatggcccttctattattgaggcatgctgtgaccat 

cggaaatgatgatggagatgacggtgatgtctccacctttttctctcttttcttgctccgtg 

ctgctggttttcttctaccttgctacatcatggcttgggctatcagtatcatgcagcgtcga 

aggcaaagacaggaggcaacagcacttgcggcaacagaagttgctt teat get gcaggcagg 

gcaacataggggcttgcatgtaacaatagcaccaggacctgcacagttagctgaaccttcag 

caacaccagcacacccaactactcatgttgcaacaccaaccgcccaggcgacatcccctcct 

ccagagatggtataaatggctttgctcagtttgcttgttataaaatagttgccgataggggc* 

attttactgttggtaagttgcacaagatggggatgagtagaagggtagaggagtattctttt 

ccctttttgctttttcgatttattagctgtatctttgcattgccaaatttggagtgcagagg 

ctgaaactttttccatttgttcaatttttcattaatgcttgaacatgtaaaaatataatagc 

gaacttagctgctttcaatgtggagataccatatcttcacatcgtgtacattgtttatatat 

taccattatgggttactcttaaaaaaaaaaaaaa 

SEQIDNOlll 

ccacgcgtccgatgagacttggagggtgtcctctggtttagctgaggcatggcgagacaaca 
caaatgttgcatccaagaaaaagtcattttccattgaaactgaaattgatgatgaggcgact 
agttatgcgtctttgaacgaggacggtcatgactttgatgagattgaggatatgaggatacg 
cgggaacttgttttacaagcttgataaagattccaaggaatacgaagaatataagtttgaat 
tccatagaaggaatacgaacaagaataatggaaatgacggtccaaaagagaaggaaaaatcg 
aataacgtttcagcttctagggtcgagaaaggtctaaagggtatagatgagaagcagcaaaa 
caagaaagagaaactgagctataactctgcctctccgtttcagaattttcagctaaatgatt 
tcggagcatctccaataaagaggttaagggttccaacttttaatcagcttactgccccttat 
catgagccgttttgtttggatatttatgtgtcgaaaggttcagtaagtgctagcattatcca 
cagagctactagcaaggttgttgttgtggcgcactctatttcaaaggacatgaaatttgact 
tgggatcaactaagaatagagctacttgtgctgctattggggaagttctggctcaaagagca 
ctggctgatgatattcataacgtagtttatacgccaaggaaaggggagaaattggaagggaa 
acttgagattgtacttcagtccattattaacaatggcatcaatgtgaaggtgaagattaagc 
agaggaaaaccaagaaacctggcttccaccgcccgacagcttaggtggtcatcctacattac 
gtaggatgaaattaaaagtgacaaggaagttttatcaacgtcttataagctcgaaacagcgc 
aa tgtagtaagtagaacaaggtcagagatgtatt act acctcttttgcgaggttgcagaaca 
tttccctaaattcagtctttaaatcggtttcaatagtagttacaaacttgggaataaatctt- 
ttatttcctgcaatttgtattctctttatgagaatacattgctgttaatgtaaaagtgtgac 
tegcag 
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ccacgcgtccgcccacgcgtccggctggaactttgctgtatcatcttcaaactcttgattag 
tt at attaaagactagtctttaaactcaatgggt gat agtcagt act ctttctcact caeca 
ctttcagcccatctggaaagctggttcagattgaacatgcattgactgctgttggatctggt 
caaacttcattagggattaaagctgctaatggtgttgtaattgctactgagaagaagttacc 
atccatcttagttgatgaagcatctgtgcagaaaatacaggttttgacgcctaatattggag 
ttgtctacagtgggatgggccctgattctcgagttttggttcggaaaagtagaaagcaggct 
gagcaatatcaccgactctataaagaaccaatccctgtcacacagctggtgagggaaactgc 
tgctgtcatgcaggaattcacccaatcaggtggtgtaaggccatttggtgtttcactcttgg 
ttgcgggatttgatgacaagggtccccaactatatcaggtggatccatctggttcatacttc 
tcttggaaggcttcagctatggggaagaatgtctctaatgcgaagacatttctcgagaagag 
gtacacggaggatatagagctcgatgatgctgtacacactgctatactgactctaaaggagg 
gattcgagggacagatctctggcaaaaacattgagattggcattattggaactgacaaagta 
tttaaagttctcacgccagcggaaatagatgattacctacaagaagtggaatagattttctt 
ttccgcttaaggcattggaaaaagttgtcaggttggaagcgcagacggggtcatagcacaac 
tattggatgttcttgttggcttgattatcacttgactttaatcaaactagacttagttgtat 
gttggccatgttgtggttatttattgcctgatgtatggctctgaaaagttatatgggttttc 
ttttctcagtttcttgaacatactgattgttctatgttacctgaaacacatgacagtagaga 
aaagcattatattatttgagcaaccctcttacgtctgagaacgg 

SEQIDN0113 

cgtccgaaatggcggaagacaagaaagagtcaacgtcgagttcgccgctccaagaagatccc 
gaagatcccgtcaaatcccctccttcttcccccaattcctccactcgcaaggcttgctatgc 
tgttcttcaaagttgggtgtcaaagaagttcatgactggatgcgtggtcctcttccccgtgg 
ctgttacatttttcatcacttggtggtttattcaatttgttgatggtttcttcagccccata 
tatgaaagacttggtattgacatatttggccttggatttgtgacatcgataaccttcatatt 
ctttgtcggtatttttgcttcatcatggctgggttcaacagttttttggataggggaatggt 
ttataaagagaatgccctttgttaagcatatatactctgcatccaagcaaattagttctgct 
atttcaccagaccagaatactaatgcattcaaggaagttgctataattcgtcatccccgaat 
tggtgaatatgcgattggtttcataacatcttcagttgttctccagagagatgatggggatg 
aagagttgtgcagcatttttgtccctacaaatcatttgtatataggagatgtatttctggtt 
aattcaaatgatatcatcaggccaaatttgtctgtgcgagaaggcatagagatcattgtttc 
tgtgggaatgtcaatgccgcaggtgatttctcctatagaaaggatcacacgacagaccgacc 
ggatccctctaaacagaatgttaaagtaaacagaatcatcatctcatttgcttctggtttgc 
gctaagctaccataatctcatttttttagggaagtcgcatatgtatactgttggtcttctat 
gttcatttgatggttcagcagatctgaactggagcaattagcaacattggtgactttgttgt 
gtgtttattctttaggattagtaggaggagttctgtttgtcggaaacaaataggtagggagg 
cattgtttggctgtagctggtttactctaattaacatttcaccgtggtctgtacagtcttgt 
aacttatgagttcttgtgtttgtattataaagaggctatcagtgttatcgc 
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ccacgcgtccgcccacgcgtccgcccacgcgtccgcccacgcgtccggcagcctacagtcca 
tattcacgtgctacatcccctgccccaactttggggcatgatgqccagctctatggatcaca 
acaataccactatccgtatttccagcccctccctccaaccagtaattcgtacactactccag 
ttgccctgccaaaaggt gaga ttgccacctct get get get get gaccatgcatcgttgtct 
gttgattctgctaatggaatttctaatggcattgccaatggtggtgtaaagggaaatgctgg 
gcctacgcttgtgaggcctgcattccagaacccatccgtaaatgctaatggttcttatggac 
ggggtgcgttgcctggaggagctgcttcaggttatcacgaccctagattaggttttgatggt 
gtgcgatctcccattccatggatagatggatcaatgttcactgacgggcaaggtaggctagt 
gtcgagcaattcttttacaccatctttttcaaatggcagtgccgttccatcatcaaaaaatc 
agaatgttcatccgcatctaatgggcttccaccacccaaggccctcttctggcatgaacaca 
acaaatgggtatatgaataggatgtaccccaataaactgtatggggggcggtattgtaacac 
attcggtactggcatgggctttggatccaatggatatgatacccgtaccacaggtcgtgggt 
ggatgacggttgacaacaagttcaaacccaggggtagaggaaatagtttctacggtaatgag 
aacatggatggtttaaatgagctcaacaggggacctagaggtaaaggtttcaagaatcaaaa 
gggttttacaccagtaacgctggcagtcaaggggcagaacgttccgctcaccctaaccaatg 
atgctgagaaagaaaaaccaagcctgattcctgacagagaacaatacaactgtccagatttt 
ccagtgacatatactgatgccaagttttttataatcaagtcttacagtgaggatgatgtgca 
caaaagcatcaaatataatgtttgggctagcacaccaaatggtaacaagaagcttgattctg 
ettaccaggaggctaaacaaaagtctggtggttgccctgtttttcttttcttctcggtgaat 
acaagtggtcagtttgtcggtgttgcagagatggtaggaccagttgatttcaacaagagttt 
ggagtattggcagcaagacaagtggatcggctgctttcctgtaaagtggcacatcgtgaagg 
atgtaccaaacagcttgttgaaaaacatcacgctggagaacaacgaaaacaagectgttacc 
aacagtagagatactcaggaggtcaaaatagagcagggcctacaggtgattaagatatttaa 
ggatcatattagcaaacagtgcatccttgatgattttgagttctatgaggatcgtcagaaga 
gaattcaggaaaagaaggtcaagcagcagctattccagaagcagtcgcaggtatgggaaggc 
aaagctactgaagagaagaagaaagaaaacacgaaagtggaacctaagtcccagaaaccttc 
agaagttcctgctggtttgaacaaggaaagtttacccgctgctccgactaatggggaggtga 
agcttacagaaaatggatcagttacaaagggagatgatatgaagggtgctaaaccagtcact 
gtagcggaaaagaaacctgtagctatagggatagcaaaaggagttgctaatggatgctagct 
tcacctaatgaagggggaggtctgtggttaaagaagccctaaattggagcttgttgactaca 
tgatatgcacgccagtgcttggttagatctcataaccattggactgccccttttatcctagc 
tgcatttggagttggttcttgcattaagaaatccccggagataaatcaatagtggcaaggct 
agttcaatctgtttctaagagttcaggaagtatggaagctccattttccctcaggttttagc 
ttctgacaggtttcataccttggtttgggttttaggataatttttttttataattttgtttt 
cgtcatgtggcttattttggtcaattttcccctttttttaaaagttatttgggttttaaagg 
gtggggttcttgttattattagtttggctcccaatcctatcttgtaaatctagatcaatctg 
ttgcggcagttcccaacattgcttttttgtactaatgattgagctagaagctagttttaaat 
gtcaagtctctaccgg 
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ccacgcgtccgcccacgcgtccggcgggatttgtgagtatttgaatgaggaaaaagggtgtc 

cagtgtgatgagatatgagggttaagattgtaggaaaatgggttgtgtatttgggaaagaga 

tttcatcttctgagacacctaatggggaggttgtagttggaagtaggagagaaaatggggta 

gatagagatttggctgccccatctgggaggagagagaaagttggtactgtaaataaagtgga 

tgccgtaggtggcggcggaggaggtagtgatgttggtgaagttgtgaatggtagggatcaga 

aggatgagaagaagggtgagaatgcaaggcataggggtgagaggagaaggtctaagcctaat 

ccaaggctaagcaacccacctaagaatgtgcacggcgagcaagtggcggctggatggccgtc 

atggctttctgatgtagctggagaggctatcaatggttggattccgcgcagggcggatgcat 

ttgagaagctagctaagattgggcaaggtacttatagcaatgtctataaagctagagataat 

ctaacggggaacatcgttgcactgaagaaggttagatttgataatttggagccagagagtgt 

gagatttatggcaagagagatcttgattttgcgccgcttggatcatccaaatgttattacgt 

tgcaaggattggttacgtcaaggatgtcttgtagtttgtacctcgtgtttgattatatggat 

catgatttagctggacttgctgcaagccctggaatcaagttcacagaggctcaggttaaatg 

ttacatgcatcaactattagcagggctcgaacactgtcataaccgtcttgtgttgcatcgcg 

atataaaaggatcaaatcttcttattgacagtgggggagcactcaagattgctgattttggg 

ttggcttctttctttgatcccaataaaaagcagcccatgactagtcgtgtggttactctatg 

gtacagaccaccagagcttctacttggagccaccgactatggtgttggtgttgatctttgga 

gtgctggttgtattttagctgagctattagctgggaaacccattatgcctggtcgtacagag 

gttgaacagctccacaagatcttcaagctatgcggatctccgtcagaagaatattggataaa 

gtcaaggcttccacatgcaactatattcaagcctcagcaatcatacagaagatgtatagcag 

aaacttttaaagattttccgccttcatcgttgccattgattgagactcttctggccattgat 

cctgctgagcgtcagacagctacaactgcattacagagtgcattctttactaccaaacctta 

cgcctgtgaaccttccagccttcccaaatatccacccagcaaagaaatggatgcaaaacg.gc 

gagacgaagaatctcgaagacaaagagctactgggaaagctaatgctgatggtgtaaggaga 

aatcgtcaccgtgatcgagcagtgagggcaatccctgctcctgaagccaatgcggagctgca 

agtcaatatcgataggcggcgtctagtaacacaagcaaacgcgaagagcaagagtgaaaagt 

ttcctcccccacaccaggatggaacattaggttataccttgggttcttcacatcacattgat 

ccagcctatgaaccctcagaagttccattcacatccattaatttctcatattcaaaagaacc 

gatccaaacgtggtccggcccattggtggaacctgcaactggtgctccaagaagaaagacaa 

agccatcaaagaaggattctaacaagaaaggaaaagaaagcctgtaaagtctataatgaacg 

acgtgattctacaatggtatacttcaaggagctgcaaaacttacagattatttgtcctatac 

gtaaatcaagaagcttctcaacagcatagagaggtaaacaagcatttttatcgtagtattct 

cctttgtattcttttggataatgagaatcttttcattattgtacatgtaaattttgtttctt 

catattagcaggctctgtttagatcaataaaatcgtaacgctc 



BNSDCXID: <WO 030851 1 6A2_L> 



FIGURE 4 (continued) 



WO 03/085115 



71/140 



PCT/EP03/03703 



SEQIDN0116 

ccacgcgtccgccgttttccaacttccaatgcgcggcaaaccctaatcctcagctttggttt 

ttgcctcagaaaattcatccgtcaatttgacctctattatggggcgcagtgactctagatca 

cctgccaggggtcgtggatctcctcgtaagaggagcccttcacgcagggaaaggtcacctgc 

tcggaaaaagagttcacatgctgcaagttcagctgtagcagagaagccttcaaaccgtaata 

ggtccccgagacgtgcaaggtcaagatctcttgttcctctttcacctgcaacagagaggcca 

tctagtcgcaataggtccccaaagcgcagaaaatcaatctcccctgcatctcactccccagt 

cagagagaaaccctcgagtcgcacgaagtctcccaaacgagctaagtcaaggtctcctgatt 

cgaggttgttacaggtagagaagtcttcaggccgagtcaggtctcctagacgtgccaagttg 

cagtctcctgaatctcgctcaccctcaccacgaacaaaaagactaaggagagcagaacaaga 

gactgaagaaaagacaagggggcgcgagcctgagaaaaaccatgggagagctagtggtaggg 

ctgctctacatagggagaaggattctgatagaacagtgcctgaatcccgttcaccgt caeca 

cgaacaaaaagactaaggagagcagaacgagagactgaagaaaactcgagggagcgagagcc 

tgagaaaaatcatgggagagctagtgatagggctacacatagggaaaaggattatgacagaa 

cggtgcttgagtcccgttcaccgtcaccacgaactaaaagactaaggagagcagaaccagag 

actgaagaaaagttgaagatacgggagcccgagagaaatcatggaagagctagtgatagggc 

tacacataaggaaaaagattctgacagaatggtgcaaaatgaaaggagagagaaaagatcag 

gaaaggatgcactggataatggatcttctaagtcaagaaatggtcgatcagcttcaccttca 

gaacgtcagcataggagtcggcacagatcgagatcacctgcagcagcggacacgagagcacg 

cgatgagatgacaagctcaaggagaggtgaactcaggaatggtgatgatgactccttatcta 

aaatgcaggcggcagaggaggccttgcaagctaaaaataaagacaagccttcgtttgagctc 

tctggaaagcttgcagcagaaactaatcgagtaagaggtataacacttctctttaatgagcc 

accagatgctagaaaacccgacgtacgatggcgcttgtatgtttttaagggtggtgaagtcc 

ttaatgagcctctatatgttcatcgccaaagttgttatctttttgggagagaaaggagggtt 

gcagacattcctacggatcacccatcttgcagcaagcaacatgctgtcctccagtacaggca 

agttgagaaagacaatcccgatggtacttcatcgaagcaagtaaggccgtacgtaatggatc 

ttggaagcactaatggtactttcattaatgaaaatcggattgagccccagagatactatgag 

ctattagaaaaggatacacttaagtttggtaatagtagccgagagtatgtgctgcttcacga 

gaattcagcatgatgagtctctaaaatggttgacggaggtgtcatttgcattgattggcttt 

gacgtcagaagctttatcagatcaaatatttgctgtgccatgttactagcaggatagccgtt 

gtaagtgcttagccgaaatcgtgtaatgtggtagagatttgggcattgcttgcaaagttttt 

cactgctaatgaaaattttggtttatgcatcagtgatttatcctccagtttgtttataagct 

ctttgtcccctatatatgggatatgttattgttgattaggtcttaacttgtgaatgtgcgct 

cttttcttctaattattgaagatgctggagtgccccc 
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SEQIDN0117 

attagatgcgtttggctttgactcaagcccttgaatcaaaactgcaatcaccgttgtggcag 

gttcgtgtgaaagcaatctgtgtcctcgaggctatcttgaggaaaaaagatgacgagcactt 

tggtattatggcatcttatttcaatgaaaataaagatgttgtggtgaaatgctttgaatctc 

cccaagcgtcattaagagaaaaggcaaacaaagtcttaagtcttctgaatgatggacaaaca 

gctgattctgtgcctcatgtagataggtcagcaatggctggtgcccctgttgttcagatgcc 

cgacttgatagacacaggtaattccgatgatctgtttggagcagatgatttagcaaatatgc 

agagtggtgaagggataaaaattgcatccacctctggcgcccctctggttgatgatctattt 

ggagacaatttgggtggcggcgtggcttccggccagcagaaaaatggtgatgacccctttgc 

tgatgtctcatttcacaccagtaatgagaaggcgcttgaagctgatcacttttctggaatga 

catttgataaaacagatgctactgaagtccatttggctgtcgatagaactggacctgaactg 

tttgacatgtttggtcccaqtgttgaagttccccaggatcccaataatcctagaaaggagat 

tcacgatttaatgaatagtctctctttgaatgggaatgactcatctaagaagcagaatggca 

gctcaaggggaacctatccggatatgtttcaagagtccactattgatcctcatcaggcttcg 

aatgatgccttgaacagcatattttcctcccaggctggtggagcaaattcaaatcccatgtt 

tcctttgggtgctatgcagtataacttgcctcctggcttcgtattgaatccatcatttgctc 

ctcaggctctaaactataatgccatgggtaacatgtttgctcaacagcagttctttgcaaGa 

ctttccagttaccagcaattagctaccatgcacccatccactagtgctagtcatgccgctga 

ttctgctggaggttatggttcagctcttcccgatatcttcaatcctagcatttctaatcata 

gtcctacttccttgatgaatacttcaaagaaagaagatacaaaagcatttgattttatctcg 

gtaagtttggtccttgtgtacttcaatttttctattattactttgaagatgcattgaattgt 

gacggccctagtgtgtccctgattttgaaggtatgccattaacaattatgccttttgtttat 

ctattttatttggcctaaatccctccttcctccactccaaaaagatgaggtcctccgccttt 

attcttgtggtgataaatgaggtccattgcattgtccttttccaggcccccagtcatgtaaa 

gaataagtggacttggaaatactctggcatcataatcagctattccttttctgttaatgtac 

ttagatatctcattgtagggcttctcagtgcttcatctttttttgtcaatgttgtgagcaat 

aaagtttctcagttctgattgtgtgcaatatcatcttttccaaactgagaagactagaaaac 

ttcatttaggactgtatgactctaattttgttgccatggtggattccctgtgttttttgcag 

gatcatttggctgcagctcgtgatccaaagagggtgatttgagtggttatagctcaaagcaa 

cccagagtatgctgcttataagatttagctatgcacaatttgaagcaggagtaatctgtaaa 

ggttcttttgaagcagtgatatcaatgtgaaatacagtattatttttttttt 
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SEQIDN0118 

ccacgcgtccgaacgattctccctcgtaacttcattttcagtcatggcttctgctactaaga 
aggtattggttccgattgcgaacggaaccgagccgattgaggctatagtgcccatagatatt 
ttgcggagagctggtgcggaggttactgttgcctcagttgagaagcagcttcagattgaagg 
agtgcacggaattaagatcgttgctgatgctctaatttctgattgtgcggatactgaattcg 
accttatctcacttccgggagggattcctggtgcaaccaacctaaggaattgcaagactttg 
gaaagcatagtaaaaaagcaagctgaaaatggacggttttatgctgcaatatgcgctgctcc 
tgctgtagcacttggatcatgggggcttctgaaggggctgaaagcaacatgttatccgtcgt 
atatggaggaactatcatcttataccattgctgttgagtcaagagtccaaaaggatggaaca 
gttgtgacaagtcgaggaccaggaactgcgatcgagtatgctgttgcattggttgaggagtt 
gtatggaaaagagaaggctgatgaagtttctggcccactcgtgatgcgcccaaatcacatcg 
aagaatttgcatttgctgagctcaattcagtaaattggacatttacgagtaagccacagatt 
cttgtacctattgcgaatggttctgaggaaattgaagcagctactattatcgatgtacttcg 
acgagcaaatgctcaagtagtagtggcatctttggaagatacattggagattgtcgcttcca 
gaaaagttaagctagtagcagatgtgctccttgatgaagctgctaagcagtcatatgatctt 
atcgtcctgccgggtggtcttggcggtgcccaagcatttgccaactcagaaaagttggttga 
catgctgaagaagcagagagaatcaagcaaaccgtatggagcaatgtgtgcatctccggctc 
tagtcctagagcaccatgggttgctcaagggtaaaaaggctactgcctttccagctatgtgc 
aataagctctcagatccaagcgaagcagaaaatagggtgttggttgatggcaatcttgttac 
tagcagaggaccaggaactaccatggagtttgcactggccattgcagataagtttattggcc 
gcaaggaagtactagtgctagcaaagaagatggttttctaagtagaatcattttgcctgtct 
tccgtctttaggattatataggcacccatagttacccatagttgtaaacttgtaa'taaactt 
tggccatcagtgtgcacttaaaataaaagaaatgctgtatacgagttacactcagtcgcatt 
tgctaatttcctattcaatgccatcgctttttaaaaaaaaaaaaaaa 

Group 3 

SEQIDN0119 

GNATGNCTGACCNAGNTGNGTGCTTAAGTNTCGCANGCNNCTGTAGTGNAGGGGACCNNNCA 
NTNTCTNCTNGACGNCCGCAGTAACCAGNNCTCTNAACCNATGCATNATGCAGATNCAGGCT 
TTNCAGTCTNTTANGGCTCAATTGGTGTATGCAAGNTCCAGGACATGGTGTACGATCAGATN 
ATGATCTGCAAGCGAAGAATTNTGNTTTNTCAAATGCTCTTTCAAGCCCTGTTCGACGAAGC 
CTGCAGAACTATCAAATTGCTCANGGAGGTTTCCTT 

SEQIDNO120 

ACTCGCGGACGGAGAACAACCGAGAGAANGGGAGACATNGNTCCAAANTCGTGGACTNCCAA 
TGTACTGNTGAGCACTNNGTAACTNATNATNTGGNTNATGAGGGCNNGGCANGAATAGGCAG 
NACGCGGGNGCAAACCCTGCGAATTGATACGATCAGATAAAAGATCATCANNATGGANAGGN 
GCNGNTGTTCTGGGGT 

SEQIDN0121 

TNTGGCTAAGAACTGNNCTATANCTAGNGACANTGTGCTATTCGACTCACAGGAAAAACTAC 
TAAAGGATTCTCTAGATTNGCATAATTTCCAAGAGATAAAAAAGTTTTTCCATGCTCACTTC 
AAGGTGGATCGAGAACTTCAGGCATCTGTTGCTGTGTATGCATTGAAAGGCCAGNGATTCTG 
TTCTTGATGAGCAAAATAGAATCTACAAGTTGGTGT 

SEQIDN0122 

FIGURE 4 (continued) 

BNSDOCID: <WO 030851 15A2_!_> 



WO 03/085115 



74/140 



PCT/EP03/03703 



GGNGAGGNGACGGNGANTGGAGCNGTAGTGTCGCGGGAGGAGGGACAAAAGCTGNANGNAAG 
AACAGCGNNACAAGANTNACACCTNCTGNAATATANT 

SEQIDN0123 

AATAGCAGCGGCAGCATACGACTACGAGAACGATCCGAGATGGGCAGATTACTGGTCCAACA 
TNCTNATTCCTCCTCACATGGCTTGCCGTTCCGACGTNGNNNACCACT 

SEQIDN0124 

TTNAGNATCCGNAAGTTGAGCAACAACTTCAGCACTCGNGCAGAGAATGGAAANTCGATGAC 
ATTGTGNACGCTNGNGGCAGTGGGGGTACGGACGCTGGATTGTCCATTGCATCCAGGCTCAG 
NGGC 

SEQIDN0125 

ACTCCTTACAACCAGCAGATTGCAAATTGCTGCAAGGGAGGAGTGATCAACTCATGGGGNCA 
AGNATACTGCAACATGCTGTTAGCTCATTCCAAGTCAGTGTNGGTGCTGCCGGAACAACCAA 
TAAAACAGTTAGAGTTCCTAAGAACTTCACC 

SEQIDN0126 :: 
GCCAGTCATTCGTTGTCCCATAAGCCCAACAGCCACCCAAAACCATCCAAATCTGAGCTGTT 
NTCCAGCGCTAAGCTAGTGGCCGACGCGGCCAAAGCAAAACTCCATCACGAGCCAAACAGTA 
AGGTCGACAAGTCTGAGCTCGCCGGAGCCGCCGCTGACCTCC : 

SEQIDN0127 

AGAGNCAAATCCCACATGGGGCATCTGGCTTGGGCGGATGCTTTTGTCATCACAGCAGATTC 
ATGTNTAGACATGTTGAGTGAGGCTTGCAGTACTGGGAAGCCTGTGTATGTAGNTGGAGCTG 
AACGCTGTACGTGGAAGCTCACAGATTTCCACAAGACACTCAGAGAGAGGGGACTGACTAGG 
CCATTCACAGGACTTGAGGATATGTCAGAAAGTTGGAGTTACCCTCCGC 

SEQIDN0128 

CAACAAGAGGAAANTGGAGTTCAATTGACTTGGGAACTGAAATTTATAGAGACGCTAATCAA 
GTGGCTGAGTGGACTGTCTCTGATTTTGACATTCTTGTACCCAATAACAATTAGAAGTA 

SEQIDN0129 

CTTTTGCTAGAATCTTGCAAGCTGGACAAGGTGAAATGGTCGGGAGTGAAAACACAGCATTC 
AACAACCCCGTTCGTTGATGAAATGTACGAGCGCCTGAAAGAAACTCTAACTGACTATGAGG 
TCATCATCTGCCGTTGGCCGGAGTACACATTTGCATTGGAGAATGCCATTGCTGATATTGAG 
AAAGCAATTTTGGATGCACTAGAGAAGCAATATGCAGATGTCTTGTCACCA 

SEQIDNO130 

CAGAAAAGGAGGAAAAAATGAGAAAATATCTTCTGCTTAGAGTGTTGTCAAAGCTTTTGCCC 
TCACTGCCTTCCTTCTCATCATTTTTGTCCTCTTCTCTTGGTCTCTCTCTGTATAATTATGT 
AGTAGATAAAACTTCAAGTATTCATTTGAGGTTTTTGTTTCCTAA 

SEQIDN0131 

CTCTNCCTCACACGAANNAGTACTTGACAAGGGAGTTAGTACTTTATANNGACGACANTTTA 
GNCN 

SEQIDN0132 

NTNCCATGTTNANAAAATNCAAGCTCTGAATGGAAACGGCTTGGGTGCTGATACTTCTTCCT 
TTGGTTTCTTGGGACAGATTCCTCGAAACTTCAGNTTGTCGGAC 
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SEQIDN0133 

GTAGNATTCTCCATTTGAAAATACATAGTGTCAGGCCATCTGGGACATCAAACGGAGGTGGA 
GAGACTAGTCTAAGGGCAAGGAGACCACCAAGCCAAGATCAGGA'TGCTGCATTAGCATTGCG 
ATTGCAGTATAGGAATTCTCTTGCTCTGGCCAGATCGAATTTGAGGGCCATGGCATCAAGAG 
CCA 

SEQIDN0134 

NNNNGNAT.NTNTCTGNTATAATCTTGCAAGNTGNACAAGGTGAAATGGTCGGGAGTGAAAAC 
ACAGCATTCANCAACCCCGTTCGTTGATGAAATGNACGAGCGCCTGAAAGAAACTCTAACTG 
ACTATGAGGTCATCATCTGCCGNTGGCCGGAGTACACATTTGCATTGGAGAATGCCATTGCT 
GATATTGAGAAAGCAATTTTGGATGCACTAGAGAAGCAATATGCAGATGTCTTGTCACCA 

SEQIDN0135 

TCTGGGTATCATTTGGGCTGTTCGGTACAAGACGGATACGGAGAGTCACTTANGAGAAACTG 
TTNGGAAGAGAGAGAAAGAGGCGGGAAACTATTGGCNAGGTGTGTGGAGGAATTGAAGAGAA 
AAGGGGTGGAGTTTGATTTGTTGAAAGAGGTTGNCGCTCTTAGGAGGGCTANNAGTTNGAGG 
GTTGAAACTAAGGTTG 

SEQIDN0136 

GCACGTTTGGCTCGTCTGCTCTCCCGCAAGAAAGTGCGAGTGATATGGATATCAGTAGTTCA 
GTACTTTTGTTAGGGTCAGCTTGTTGGGGAATCCGGTTCTTTTGTTATTAGGTGGTAAAAGA 
AACTTTTATGTCGCTG 

SEQIDN0137 

TTANGNGCCAATGTTTCAATACACATTGCCCCCGCCATGAATATCGGAACAATGACAC7\ATT 
ATTTGATGTAGCACAAGAAGAGTGTTCAGNCCTTTTCTTGTGGACTTATTTGGTTGCAGCAT 
TTGCACTTACTATTTGGTCCACTGTATTCATGTGGCTCTTGTCCTGATTGTCACAAGAAGAC 
ACAATATGAAGATATTATATAATGGTGTGGTGTGCT 

SEQIDN0138 

TGAACTGAATNTGGTATCTGTATTACTCCTGTTGTAATGGCATTGGACTTATACGGCCTTGG 
SEQIDN0139 

TGTTTGTCTTCCGGCTTTGAGGGCTTTCCAGATGGTACATATGAAGGAGTTGAAACTGGATA 
TGACGAATTATTTAGGACGCCATCATCCCTCACGGAGAGCCCAGACATTCTCTGG 

SEQIDNO140 

TTGAANNCCCNTTNNGANGCACCACAAGNTNNAATCCTTNCTGTAAATGGTAGCAAAATCCT 
ACCCGATTGGGGATACGGNAGAGTTTATACTGATTTAGTTATCAATTGCACTTTCCCTATTC 
CAGTTGGNACTGAAAATGGAGGAAAACTCGTANTTCATGCCGCTACTAACGGNGGCGGNGAC 
ACTAAATTNAACACCGCCGACACTTTTNTAGGG 

SEQIDN0141 

CANAATACCCCTTNAAACGACCCGAGTCTCAAAATCGANGAGATTGGAACACTTTCGGTCAG 
TACTTGAGGAATCAAAGGCCACCAGTTTCCATATCCCAATGTTATAGCAACCATGTCCTAGA 
GTTCCTCCGATACCTGGACCAGTTCGGGAAAACTAAGGTTCACTTACAAGGTTGTATCTTTT 
ATGGACAGCCCGAGCCGCCAGCTCCCTGTACTTGTCCG 
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SEQIDN0142 

CTNGTGGNANTAAAT7^CATTCTCATTTCTTTCCNNTNTTNTTCAGGTCCAGTACCACCAAT 

GGAGNCTTTCCCTATNAN 

TAATTCATATAGTCG 

SEQIDN014 3 

GNGATGTCTNCNTATTGCACATCTGNTGTTGATTCACTTTATGGGAATGCTCAGAAATTTCA 
AGCAATCGAGACAGATAATCGCAGGCAACGAGCAGCTCTAGTGACCTTACAGGAAAAGGTAG 
ATGCTGTTGCTTACCCAAGAGGAACTCTGGGTGAAAAATACGTGCATACTTCCG 

SEQIDN0144 

GAATGGA7y\AAGTGAAAGTGAATTGCTTGACGGATGCCGAGTGCTACTATACATGGCTACTT 
GTAACCACCCTGCTAGTATTTGATGAACTGCTGAAGTCTATAAGGCCGTATGGCACTCTTAG 
TTTGTTGTCCGCGCTGATGTGTGAAGTCA 

SEQIDN0145 

GTTCCTTTGCCATTTCAGCTGCCGGCGGCAGTATCTTCACCTGGGACGACGTCTTTCAACTC 
CCCGAATCTCCTCAAAATGACTCTTCTACCCTCTCAGCTTTCTTCGATAAAATCAAGCTCTG 
TAATCGCAATTCGGAGAAGCAATCCGAGTTCATGCCTTTCGTGATTGAAGACCAAATAATNG 
GAT AC G T AC AC C AC GT G G T T G C T G A 

SEQIDN0146 

AGANTTTAACTCNATCCATTACTGTANATGGNATGCAAAATCCTACCCGATTGGGGATACGG 
NAGAGTTNATACTGNTTTATGCTATCAATTGCACTTTNCCTATTCCAGNNGGNACTGANAAT 
GGAGGAAAACTCGTANTTCATGCCGCTACTAACGGNNGCGGAGACACTAAATTCAACACCGC 
CGACACTTTCGTAGGGTTACTCAGGACTCATNAAGAAGG 

SEQIDN0147 

TTTNCACACTAATTCCCCTNTATCTACNAATGAAGTGCGTGGGCTTGCAGTACCTGGAGGCC 
ATTCGT7y\GCTCAAGGCTTCTGGCTTCCAACCAACGCGCACTGTCTACCTTTCCTTCGTCCC 
CGACGAGGAAATCGGCGGNGNCGATGGAGCCGGAAAGTTTGTCGATTCCGATGTCTTCGTGA 
AGATGAATGTTGGGATTGTACTTGACGAGGGCTTGCCTTCTCCCACCGAAAACTATCGTGCA 
TTCTATGGGGAGAGGTCCCCCTGGTGGCTGGTCG 

SEQIDN0148 

GNTCCGTAAAGTCCCCAG1SINTNCNCGACCCGTNACTCNGGAGTTACAGCGANACANGTGGCT 
GNATNATNNGACATACTCAGACCTANTTAGCTTTGATATAATCCGTGAGGGTAANTTCGTTC 
TTTGCAANCAAATGGACGAACCTGGTATGTTTAGCCTAATTGCAAACAGGTTTGCTGATGCT 
TTTATTTCATGGGTTTCAATTTGTACTAAAGCTCACTTGCCGTTCTTCATGTACTAGAAAAC 
TACATATGTCTATGACCCTTTACCTAGTCTGGTAATTTCAAGGCATGAGATTGNGATTGATC 
AAAACAAGTTGGAG 

SEQIDN014 9 

GGTAACTCCGATATCATCGAGAGCCGATACGCACTCACAAAGCGGCAAGGTGCTCGCTATGT 
GCCTGCTGCTTTCTTGACTGGTTTGCTTGACCCGGTAAAGTCCAGGGANGAATTTGTCCAAC 
TATTTGCTGAGTTAGAGGGTAGGATACCAGTTCTAGTTCTGGCAACAGCAGGTTCTCCGAAG 
AGGTCAAAAGCAGAGATGGAAGCACTTATGGAGGCCAAAGGGGTGAGCAAGTATATCGAAGT 
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GCCAGGTGCTCTCCTTCCCCAGGAAGAGTATCCTGAAATAGTTGCAGAACAGCTTTACAGGN 
TTCTGCAAGAGAAGTTTGAGCTTNAGGC 

SEQIDNO150 

TTTTNNCACTTCTAAAACCCTCGTNTGANNCTGCNAGGCATGTGAAGNTGTCAAACTCAAAC 
CTTATGCCANAAAGTGCAAGAAACTGATCTTTGAATATGCGCCAGTGATTCTCGTAAATGCT 
GAACAGTTTCTGGAAAAA7\ATGACGTATGTGCTATTCTTCATGATTGAGAGCCTGCAGCAGA 
TAAAGAGCTACAAGCATCACCAAAGATGCTVAGCTTCATTGCATTCGGCCTC 

SEQIDN0151 

CTGNCCCTATCCGATCCAATAGTTGACTCAAAGGTGTTGCCTATTCCAGCCGGAGATTTGAG 
TTTTGGTTCGGGTGCACAACTGAAAAANTCAGTTGGNAATTGGTCAAGATGTCTTACTGATT 
TGTTTGGCATAGATGCTGAAGATTCCGGNC7\AAATGATGAAGGCAGCTTCGGAGATGATCAN 
AGGAAAGGTGGAAATCAACCAGAGCATTTCCATCTTCTCAATGCC 

SEQIDN0152 

AATCCTGTTCAATACCAAAATCAAGCAACAAACGGTTGGCTVAGCTTCTTGGAAAAAGTTTTT 
GATAGATATGATGTAC 

SEQIDN0153 

TCAACTGAGAGGTGTGGGAAGAAATGAAGAATTGTTGATGGCTTATTTTGCAGAAAGCCTTA 
TGGGAGTAGCTCCGAATGGTTTATGGATCAAGACACGTCTCGCTGGTATGTCTGGGATGACA 
TGGCACAGGCCTTTGTCAAACGGTTCCAATACAACATCGACATTGCCCAGACCACATTTCCC 
T T T C AAACCT GAAGAAGAAAC CAAGT GAAAGTT TCAGGGAAT AT GCC A 

SEQIDN0154 

CTGGGTGAA7\AAGCTCTCCTTNTGCCTTNCCAGAAGCACCTAGCGCAACATGTAATGGATAG 
ATGTGCTCGGNCCATGGATGTGCAACTTTTGCATGCGGAGCCTTCATGTCATAGTTA 

SEQIDN0155 

CGCTGCTGTGNGTTG7\ATTTTCTCCCATTTTTTGGAGAGGTGTGTATCTGGA 
SEQIDN0156 

AATGGAGAATGGAAAAAGTGAAAGTGAATTGCTTGACGGATGCCGAGTGCTACTATACATGG 
CTACTGTAACCACCCTGC 

TAGTATTTGATGAACTGCTGAAGTCTATANGGCCGTATGGCACTCTTAGTTTGTTGTCCGNG 
CTGATGTGTGAAGTNA 

SEQIDN0157 

GNTATGTTGCTGATCAATCTNGTTATGGCATGGTTGATCCTTCTCAGCATTATTATCCGGAG 
CAACCATCCAAGCCGCAGCCAAGCATTTCGAACAGTCCTTATGCTGAGAATTATCAACAGCC 
ATTTGGTTCTTCATACAGTAGCGGCT 

SEQIDN0158 

TACCCAAAAATAAAAGTACCATCCTGATGCATCCTAATGTGCTACATATTGCAATCTTCATG 
GGTAAAAGAGGTCATTTGGCGGACCAATGAGGT 

SEQIDN0159 

ACGGGGCCTCNAGGCTAATAAACAAACAGAAAATGAAAATTCTTTTGAGAAAGAGTTGCTAA 
AAATGCAAGAAAAACTTCAAAAGATGACACTTGAGAAGGAGCAGACTGAGGAAATGTTGAAA 
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GCTAGAGAGGATATGTTGAAGCAGAAGGAGGAAGAGCTCGAAGCTCGGGATAAAGAGCANGA 
AAAGCTTCAAATTGAACTCAAAAAGTNGCAGAANATGAAAGAGT 

SEQIDNO160 

TTTTNNGANGTACTAANNCNCATTNTAGCCGACCGCACTCACAAAGCGGCAAGGCGCTCGCT 
ATGNGCCTGCTGCTTTCTTGACTGGTTTGCTTGACCCGGTAAAGTCCAGGGAAGAATTTGTC 
CAACTATTNGCTGAGTTAGAGGGTAGGATACCAGTTCTAGTTCTGGCAACAGCAGGTTCTCC 
GAAGAGGTCAAAAGCAGANATGGAAGCACTTATGGAGGCCAAAGGGGTGAGCAAGTATATCG 
AAGTGCCAGGTGCTCTCCTTCCCCAGGAAGAGTATCCTGAAATAGTTGCAGAACAGCTTTAC 
AGGTTTCTGCAAGAGAAGTTNGAGCTTNAGGC 

SEQIDN0161 

TAATCNCACNAATNGAGGCCCTATGCAAATCTCNTTCAAGTGGAGTTTGCTTCATACTTGCA 
TTATTGTCACTNT 

SEQIDN0162 

TTATTGGGTAATTCCCATCTACTGGGTCTTTCTCAAGATTTNTCAACTGCATCTGTAGCCAG 
T G AT AAC AAG C AG GAT G 

SEQIDN0163 

AAGTAAACTCNCGGTAGGGAAACTACNNCGATGAAGGTCTTCAGTCAGCTGAACTACTTGGA 
TGTGCTCATATCCGTTNGAATGAGCTTGAGCCTGGTAAAGTAAAGGNTATTTGG 

SEQIDN0164 

GNCGCGANTTCCTTCGTCCAAGACTGANGCTTTNTANTTCAAACAGGTAGTTCAAATGCTTA 
CTGGGTCCTCTGAAACCGCCAAGGTTGCAGCTACTCCGGGTCGGGCTGAGCCNGTTAGACAT 
CNTATCCCGCCCA 

SEQIDN0165 

TGCTGTGANTCTTTTGCTACATATGCCTACGATTTACGAATNTGCAAATAATGTTGGATTTT 
CCAGGCATCTTCAGAATTGCCTATGGTGCAAATTGCTTTTCAGTATACTGTTGTTGNCCCAC 
CAGATGAACTTGCAAATGCAGGATCAAGTTCTACAACAAGAACAAAGCATTCCCTCAAAAGG 
AGA 

SEQIDN0166 

GCACNTGTCGAAAATCAGGATTGATGTCAATGCTGATCAGCACCCCTTTCAGTACAAAANCT 
AAATCAACCACAGAAGCCAGCTAAGGTGGACNTGAANTCCGCAGTTTATCCTGGCGGTCCAC 
CTTCACCGGCAAGGGCGCCAAAGATGTCGCACTTTGTCGATACAACAGAAATGGTAAGAGGA 
CCTGAGGAGTCACCTGGCTACTGGGTGGTAACTGGTGCAAAGCTATGTGTAGAANATNGTAG 
GATAAGAATGAAAGTGAAGTACTCGCTC 

SEQIDN0167 

CCTTTTNAGGCCACGNNTNGGAGCAGCAAACACAGCAAATNTAGCGATGAGANAGCCAGTAT 
TCATAAAAGTGGATCAGTTGAAACCGGGAACAAGTGGTCACAATCTGACGG 

SEQIDN0168 

TTTTNCTTTGGATACATGGCTTGCATCTGCTATGGTTTCTTCCTCATGCTAGGGACGTGTTG 
GTTTCCGCTCATCCATGCTCTTTGTTCGTCACATTTATCGATCTATCAAGTGCGAGTAAACA 
TTTTGTGTAGTATTTGTTTCCTCCACTTTAGCCTCTCTACTTCTTCGGGGTAGATGAAAAGT 
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SEQIDN0169 

CTCCTACTCCTCAGTGTTTCTCAGCCAGCCGTGGAACTACAAAGGCCACTCCATCTAAGGCA 
AAGTATAGACCTCTGGAGACAAGGGGTATCCTTCAAGAACTGGAACAGAGCAGCAATGAAGA 
GAAGAGAAAGGAAGATCAAGGGAAGATGATGAGTAATAATCAACAAGGACAGAGAGGTGGTG 
CTATTGTTGCTGAAAAAGAAGCTGCTGCTAGAGCTTTGGATGTCTTCTGGTTCTTGAAACCT 
TGCACTCTTTCCAGCTGAAATGGTCAAAGCCCACTGCTGCAGAACATTTCATGAAGTGATTC 

TTTCATAC 
SEQIDNO17 0 

GTTCCCTCCTACCAAACTTGAGGAAATCAAGTCTATGCACAGCCCACAGTTAGCACAAAGGG 
CTTACAGCCAAGAGTCAATGTACTAAGAGGAGAAAGGAGCCCCAGATTGACGGGCAAGGGAC 
TTGAAATAAAGCAAACTCCTAGCCCCCAGCCATCTAATCTGGGTCAAAATGGTCGTGGTCCG 

TCTTCTACCTAGTA 
SEQIDN0171 

NTCNAGANTGATCGANCAGANGGTGCNGATGATACTTTGGNAAGGCCTAGTGAAGAGGNCAA " 

CTCCAGATAATGAATCANAGTTTCAGGTGGAACAAGAGAGAGAGCANNTAGCNGCGGACGAA 

AGGGAAGAGGGAGAGCTAATTGCTGATCCTGAAGATGTTGGAAATNTCGAGGGAGTNAGCAA 

TTTA • 
SEQIDN0172 

TAAGGNCANNCNAAGNCCANCAGTGCCATNACGCNNATTGCCTGACTGTTCANTGCCTACAN 
TNTGCNGTANTTCTAATGGCGANCTGAAAATGGCCAAGGNCCCCNAAACCTAGAGCTNTGTC 
AGTAGANTNGGGTNTATATTTGAATTNGATNCTGTTGAGTGATAANGATGGTGGACNCNTTG 
TACCTNTACCTGANTGCAAATAANGTNTTGTCATCAACAGANNTTATGCTA 

SEQIDN0173 

GTTAAGNGNGGCNANGAGGAGGCTGTTTCNATGCAGNNNGTCTGGNCTATCNNGTNNTNTNT 
AGGNNNANATCCTANGNCTCACCTGGNTCTCTTTAACCCTGAGNATCATTNCACCACTTTNA 
CTCAATNTNCTCAGNCCCANCTNTTCCTNTCAAAATTCGAAATTATTGTNCCCATAGTATAT 
ACTCTGTTTCTGGTCCTCCTTTTCCTCTGTGCTGTAGCCACAATTACATACAGCACGCTTCA 
TGTATCCTATGGTAGACCTATCAACCTCGTTTCCTCTA 

SEQIDN017 4 

GTCGNGCAAAAGAAGTTGTGGCTCACAAGTGGAAGCATCAGAGATACAGAATAGACAGTGNA 
GTTTGAACACTTNTTCCTGATTTATTTTCTCTCTGCCTTTAGGGA 

SEQIDN0175 

ATGAGAAGCCAAGGAGTCCCTAGTCTTNCGNGCTTGTTACCTGGGGCCCAAAGGAGCAACTA 
GTGACTCGNA 

SEQIDN017 6 

NNTCATCACCTATGCCACTACTCTTCTTCTGGAGAGGCGTGGGAAAGAGATTGTCTTGAAAG 
CANTGGGCACAGGCAATTAGCAAAACAGTTGCTATAGCAGANATCA 

SEQIDN0177 

FIGURE 4 (continued) 

BNSDOCID: <WO 030851 1 5A2_I_> 



WO 03/085115 



80/140 



PCT/EP03/03703 



TTCGGCCAAGCTGACGCTTCCTCTTTCAAACAGGTAGATCAAATGCTTACTGGGNCCTCTGA 
AACCGCCAAGGTTGCAGNTANTCCGGGTCGNGCTGAGCCNGTTAGACATCCTATCCCGCCCA 



SEQIDN0178 

CGTNGNCCAACTCANGGCTGAACAAGTGATGAAGNGCCCTTCACGGGTTCCTCTGNAAGAAC 
CAGNGGCAGTTGGTGGTAAACATATGTCAAAGTCTNCAAGTATGAANGGAATCATCACCCCT 
GCGCC7\AGGTTGAGNTTCTCCCCTTCCTTACCTATCACCCGAGCATCGGNTTCTNCCTCAAA 
GNCTTCTACGCAGCCCTCGTCTCGTCCTTCA 

SEQIDN0179 

CGNGTGG7\AATCGCCCGTAACTGACCTGACATTTCCGGAGTTTACTGAGGAAGAGTC7U\CGT 
GGGACATGTGTTCGGATAA 

SEQIDNO180 

NTACTGGTACTNGAGCGGGGGAATTTTTCGATTATCTGCTGGGCTGATTCCATTCAGTAATG 
CTTTCCGGGAATGTATAATAAAATGCCGAGTTGGTGACTGAGAAGAAACCTTGTAAATAAAA 
TCACATAGTTCGTTGNANGAGTCGTGGTCATCAACTCCAATTCTGCATTTCACNCTCACTGG 
AACATTTGTATTTGCAGCAATTACNCACATGGCCTCAGCAACAAACTTTGGATCAAGCATGA 
GTCGCACACCAAAACACCCATGTCCAGCTACTTTAGGGCTAGGACATCCGCAA 

SEQIDN0181 

AGTGCCCATTTTNTCAGGTTGNANAATGAGCACCTTGNAATAACGTCGGGAAATATCGTGCC 
TTCAAGTCCGTACCATTGCCTT 

SEQIDN0182 

ATTCCCGGTTTAACCTCCAGTATCCTGTTTTNCTGATGAAGACATGCTAGAGGTCCCAACAT 
ATGCTTTAG7U\GGCTAGAAACTTGTGAAGATAGCAGATGGTCATAATTGTAAAACTTGGGTG 
TCATGAAAATATACGTATCACGACAATGACTGGTGGCTGCAAAGTTGAATGTGTTGCTGATG 
GTGACTGTATAGTTAGTTGAATGTGTTGCTGATGGTGGCAAGAGAGGGTTTATACTTTTGGT 
TGTGTT 

SEQIDN0183 

CCNNNGNAAATCCCATACANATNTTGCNCTAAACTTNCTCACCGAAACAAACCTGGATGTTC 
TCTGNGAACTGGNTGATNTNC 

SEQIDN0184 

CNTTTGGAACCNCCCTCAAGCTGAAAAAGAGGANGGGATTATACCTCTTACCGACACATTGC 
CAATTATGCCACAGGNTTTATGTCACCATGCCC 

SEQIDN0185 

NAGCGCTTGGTCTTNCCTCACTGCTTNTGGTGNTNAATCTTGGNNTNTGTNCCNAGNCGTCN 
TCNNTT^ACATGAAGCNTANGGTNANAAAGGNAGAAAAGTNTNCTGCTNGAAGGGCTATCTTG 
CCCNTGCTTCANGCTGAAGAGGATGAAAGATTCGT 

SEQIDN0186 

GCTTAGANNTCTTGAAGGTGACCTGATAATGGAATCTGGCCAATTGNCTACAACACCTAGGT 
ATGATGTGGGAAGCCAAAGTGGAAGGATTTTGTCTGATCACTACATNCAGCNTCATAGGTAC 
AGNGNCTCNATACTAAANGATGGGTTGGAGGGA 
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SEQIDN0187 

CTTGGATGGTCNACCAGATTGAAGAACNCGAGAAAAAGCTGTTTTCTCATCCACTTCATAAG 
T CACAAAAT GAACANCAGC 

SEQIDN0188 

GANAGCCCATGCTGNTNTANNANAAGCTGCNGAGGCTGACTTGGNGGNNCTGTACNNAGGAT 
ACCAANNTGTTNTTGCTNATNCACGCTANAAGGGNGACTATTANGCCTAAGGATATTCAGNT 
GGCNAGGCGTATTACGGGAGAAAGGGC 

SEQIDN0189 

CGTTAGTACGCNCATCTGATAATGACNTTGAGAGTGCAAAATGCATGAGCTTGTTGTCAAAT 
AATCATCATCATTTCAGTACAAGGCCAAAGGCTGATTTTACAAATCAGGTCATCCGCAAAAT 
GAGGGACTTACCAGCTGCAAAGTTATCTCAGTTGC 

SEQIDNO190 

ACCTTGGAATTNCTTACTGAATGACTAAATGACTTGCTCGAAGGACGAGGTGGTGGCATTAG 
AGCACGGGCAGATATTTTGCCCCTTCTTGATTTTTCATCGCTGGACATGCTAGAAGCTGCTT 
TCACTGATGTACTAGAAGCTTGTTCTCTTGCACTTTTATCAGTCAACTCGTCATCAGCATCA 
TCACTTGAGCTTCCACTTTCAAAGAAGTCATCGCTGC 

SEQIDN0191 

ACACCNATTTCAACTACCNNGAGCTGACAGCNTACNCAATACACCGCAGANGCTTTTTGCTT 
ACTCCAAAGTTGCTCCCTAATCTGGAATACAGCGAAGCATGCAGCATTTTGACTGTAATGAA 
TGGTCCTTGGATTGAGCAGCCATCCAAAGCTTGGAGTGGTGGAGAATGCTGGAGGTCTGTGA 
TGGNACTAATGGNGGAACGTTGTCGCTGACTAGAAGCAGAAATACTTTAGGTTGATTCGAAC 
AGGATTTGTCCATAAGAAAAATTTGCTTCCTTGTTGATCTGCCATTGCGCTAGTTACAAGCT 
GAATCATGGTCGCTCANCTATGTTTTTTGAAAAATCTGTTATTACTGGCCCTTGTTCTAAAA 
TAACATAATTCTTTTGTACGC 

SEQIDN0192 

CGTTTGNNTCACGTTCTAATGTACGNTNACTTCATTGGAAACANTCCTACACATTCAGAGAC 
GAAACTGGCANACTCTTACCTCTATGACAAAGCTACATGNATTCTTGCTNGGAAACTCTTCC 
TCCCGGAAACAGATTTCAATCTGGACCATCTAGCTGCAAATCCTCTTGTACCAGAAAAAGAT 
ANACTCTTGGAC 

SEQIDN0193 

CCTTGGAATTGCCTCCATTGNTGCGGGCCAGGCGTTCCTGATGTATGGAACGAGTTGCTATT 
TCAATTTTTGTGANAAGAACAGGTGCACTTTGTAAGTAATCTTTNCACTATCATTGGAGAAA 

AGAAAAGTTCACACCTTGAACGTAATGTACATCTCGAATGAGCACGCCCATGTTTCTACTGT 
TAGTA 

SEQIDN0194 

CCTTTCTAGTTTGCATCACNTGCATTTGACTTTGGGGACTCAACACAGGGGTTGGGTCCGTC 
TAGGACANGTGTACCCAAA 

SEQIDN0195 ' 

CCTTNNACATTTTCTGGTTAGCCTCTGGTTTGTTTTTGATGTTTTTAGCACCGGTGTGCATA 
ATCCAGTGTGC 
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SEQIDN0196 

GTNNGNGAGCNTGGCNNAGGATGCAGACTACCAAAGCTCNNAANNAAGCTTCTNTGAACACT 
CTCNTAATAGGTNAGATGTAATGGTCTTTCAGAATGGCCAACACAGCAATANTGCCTTGAGT 
GTCCCAAATCACTGATGCCATGATGTGGGCTCCTAGACTGNCCTGACTNTNGCTTTGACTGT 
GCCANTGCCACCTCNTGGTAGCCNTNAGTTTTCATGATGCTTTGCCTTGGAGATCATATTGN 
CGAAGCCATGTTCCAACTGCCGCTTACAATTTGTCAGAGGGATNCGTCAGGATCGNGAACCC 
TCCTGT 

SEQIDN0197 

TGCGGANCCNGGGCATGNCAGCACGCCNNAGGACCATGCNAGCAGNATNACTGCTNCAGNAG 
ATNGNNATGANGNGNNTGNCNNNATGTTTGTTAGTCCNGCGTGTCTTTTAATANATCATCNN 
TNACCGCNTAGGTTNGNNCANCACTGNCGAGGCTNTTANGTNANNTAAGAGTCTGTC1SITNGT 
CTAG 

SEQIDN0198 

AGTGCAATGAGTATAGCTATTGAGGCTATTGAATCTGGCGCTGTCAATGCTX3CTTCTGTCCT 
TGAGCAAATTGAGCTCCAAATAGCTCAAGCTAAAGAGGAAGCTTTTAGCAGAAAAGACATTC 
TAGACAAAGTCGAGAAATGGATTGCTGCTTGTGAGGAGGAGTGTTGGCTTGAGGAGTATAAC 
AGGGATGAAAATCGCTATAATGCTGGACGAGGCACCCACCTTACCCTGAAGCGTGCTGAGAA 
AGCTC 

SEQIDN0199 

CAAAGTACGANAGCGATGGNTCTATCTCTCAGCTGCAGGCAATGCCAAGTCGCCTTGACTTC 
ACCACTGAATTCCTCTCTCTAGCTGCTCATGAAGCTATTGTCTGTCGTTGTCATCCAGTTAC 
TGTTGCTTCTCTGTCACTTCTCTTGAACTTCTATCCCACAGGGAAACAGATGCCAACAACTG 
AGGTTGTAGTTTTCAGGACTTTAGTCACGACTCTATCTCAGGGTCCTCAGAATGATTCTGAT 
ATCCTAAAGCAAATGAAACGAGCTCACACTCGGCTATCTGAGCTAGGTGCTGACAAATTTTT 
TGGGT^lAAGGTGAGATTGGGAGACGGGAAAGGAACTGGTTTTCAGTGAATGCATGGAATTCCG 
GTG 

SEQIDNO2 0 0 

CTGGGCNGACATNGCCTANTGNGGNNTTTCTGAGGNNTCCNATNGACATGATTGTGGGGACT 
CCAGGCAGGGTTCTACAACATATTGAAGAGGGAAACGTGGTTTATGGTGACATCAGATACTT 
GGTCTTGGATGAGGCTGATACCATGTTTGATCGCGGTTTTGGTCCTGATATACGAAAATTTC 
TTGCACCNCTGAAAAACCGTGCTTCGAAGCCTGGTGATG7VAGGATTTCAAACNGTGTTGGTG 
ACAGCAACAATGACAAAGGCAGTTCAAAAGCTGGTTGACGAGGAGTTTCAAGGGATTCAGCA 
TTTACGTACTTCTACATTACATAAGAAGATTGCTTCTGCTCGTCATGATTTCATCAAACTTT 
CAGGTTCTGAGAACAAGCTGGAGGCGTTGCTACAGGTTCTTGAGCCAAGNTTAGCAAAGGGC 
AATAGAGTGATGGTATTCTGTAACACGTTGAATTCCAGTCGTGCTGTGGATCACTTTCTCAG 
TGAAAACCAATTTTCTACTG 

SEQIDNO201 

TTTNNCAACCTTTGGTATTGNGCTCACTTTTTNCTATGGCAGNCTTGGCTTCTTGGTGGCTG 
GCAAGAACGGAAAGAATGGTCAANNCTTCATCAACCATGCAATTGCTTGAATCTGTCAGCAT 
CG 
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SEQIDNO202 

ANGTTGGCC7\AGGCACTTGCGAAATTCTTTGGTGCCAGGCTACTGATAGTTGATTCTCTCTT 
ATTACCTGGNGGATCAACTGCCAAAGACATTGACTCTGTAAAGGAAAGTTCTAAACCTGAGA 
GAGCAAGTACTTTCGCTAAACGTGCTGCTCAGGTGGCTGCACTACATC 

SEQIDNO203 

CACTTGCTACTCTAGCAACTGAAGGATTGGTTTCTGTTCATGGTGACGCTGTGAAGAGAATA 
TGATGAGTCTAAATTAGGAGTGAGGCATTCTCAAATTCATTGCTCAGGGAGCAGAAGTTGAT * 
ATGTGGATTGCTACTATTTGCAAGAGCACTTTGCGGGCATGTTAGGCAAAGTCATGTTTTTT 
TGTTCCTGATCAGCATTCTTCACTATCTGCCCTTTGAAACAGTTAGCCATAC 

SEQIDNO204 

NGACCGCNGCGATNCTAGAATCAGTTGANANTTGNNGNTNGGACATGGNATNTCTNGCNCNT 
GAAGCNTTTTTGTCATCGACGATNATGAATTTCTACATCCGGTNCCTCCAG7y\GACAAAAGC 
TCATGCAGANGTAGATGAGTATCACTTTTTCAATNCATATTTCTACAAGAAGCTCNAAGAGG 
CTGTACTGAGCAAGAAAGGAATNGCANATGCTTTGNTGGAATATCAACTNCCAACAGCTACC 
TGAAAGCAAAGTGCC 

SEQIDNO205 

TCAACCAAGNTGTTCCTATTGGTTCAAGCTTCTTCTTCAATCAACTTGCCTTTGCATTTTCT 
TCCAAAGAGGNATTTTAGGTACAGNAAT^AAGATAGTGCCTTTGAAATTGCGGTTTAGTTGTA 
AAAGCAAAAATCTTGAAAGAGAAGCATGTGCTGATGATACTC7y\ACTGCTAAAGCAATCACA 
TCGCATAGTTCTAAACTCGAAGACGTTATCTGGTTTCAGTGTCGGCATATTATCAAGGGCTT 
GGCTTCCAGGATTTCTCCAACTG7U\GAGTGAGGATGGCTTCCTTCTTCC 

SEQIDNO206 

NCAATGACAGTGCTTGCTCCAGCTTCCCTTACTTCCCCACCTTCTGTGGTGTCATTGAGCAC 
TTCACCAACATCACCAATGAGTCCCTTCATTGGTTCTTCTGATTTCACAGAGAGAGTGAGTA 
TCGATAAGCAAATAACTGCTGCTCAGAGCAATAGCTTGGTATCAG 

SEQIDNO207 

TTGCTTCCTTGATAGNGCNTGACAGANGCCTNANGACCNTCCANTCAGGATNACTCATTCNG 
GAGGTTGCNTGAGGAGATTTTNTTTATGTTTTTAGACTGCTGCNCTTTTTTATATCATCNTT 
TACCGACTAGCTTTGNACAGANNNGNCNAGGCTTTNAGGGGANGGTAGAGTTTGTCATAGTC 
TAG 

SEQIDNO208 

AGATGATAAAAGTCTGGACGAGGNTGGGGACCCGACGCTGCCCATCTTAGAGGACGGCGTAC 
CAACTGAGAATAAGAATATAACTAACTCACACCTTNCTCTGCAAATTCCAGACCTC 

SEQIDNO209 

CNNNNCACTCNNGAAATACTTTNNCGCCNGGCTACTGATANTTGATTCTCTCTTATTACCNG 
NTNTTTCAACTGCCAAAGACATTGACTCTGNAAAGGAAAGTTCTAAACCTGAGAGAGCAAGT 
ACTTTCGCTAAACGTGCTGCTCAGGTGGCTGCACTACATC 

SEQIDNO210 

TTTTGANGGCCTTAAGCTACATCNGAGGAATGAAGAGTATTATCGGTTGATTTCACCACACA 
TATAATGCGCACTGGACCTACATAATCCTGGAAACTGTAATGTCAAGGNGGTTGAAGCGGAA 
TCTTCACAATCATTCAAAGGAAGAATCACAATGCTGCCTATCAGTTTTNATGATGC 
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SEQIDN0211 

CGCGTNNGTCTTAACGGCTGGTCGGCTGGCATANCGGTNATACGGNTTATTNTGCCAGTAAG 
TTTGGCCTCAAAGGACTGGCAGAAGCATTGCAGCAGGAGGTTATTGGCGAAAATATTCACGT 
ATCACTAATATTTCCCCCGGACACTGAAACTCCTGGATTTGCTGAAGAGAACAAAAGAAGGC 
CACGGGTGACTAGTATAATAGCAGCCTCTTCTGGTGCCATGAAAGCTGACGAAGTTGCCAAG 

ATAGCTTTGAATGGCA 
SEQIDN0212 

GGCANTTNTTTTCTATTACTTCCCAGCCTTGGGTGGAATGGAGTATGTCTTACATCACGGGC 
TCTCNATGTTTGCAATTGTTCAATCCCTT 

SEQIDN0213 

GNTGGAATGCCNAAAGAAGCANNCGACCCCTGTCCTATCAGTGTCTATTCTTCAGTTTGCTA 
GATGAGAAAAGATCATATTTTGTGCAAAGCAATCAACCAAACTTAGTTGCTGTTGCTGTAGT 

SEQIDN0214 

CTTCTTATCTTCTACAAATTGGCTCAATATTTTTAGTCTTACNATTTTATCTTTTTNTAATT 
T T N AT AAAG AN AT AT AAT T N ATT TG AGN G A 

SEQIDN0215 

TTCNGTTACTGGGGNTGATGATTTGACAAACCCGAAGTGGTATGTGGTGTGGTCTGCAAATA 
TGAACACTCACATTCTTCCCGAATGCGTAGTTAGCTACAAATATGGACGTCATATGTCAGGT 
CAAGCAAATNGTGCTTCATCCATGAAGTGGGCTCCTCATGCTTCAAATGCAATGGGTACA 

SEQIDN0216 

GNAGCAATTATCCTTCCTTTTCTCTTTTCAATTATTTTTCGTAAGGGTAGTTGCTGATAGGT 
TTGGAGGTCCAATGGCGATTGGCATGAAGCAGATGTCCATAATAATAGCAACCCTTGGTGTT 
TTATCCTTTGTATTTGGAGTTATTGCTGAAAACAAGAAGCCTGCANCTGGGACTGCAATACC 
AGGAAAAGGCGTTGTTATTTGTAAATACAAGTCTGACCCTACTGTTGCCTTGGGCTATTTGT 
CTTTTGCTCTTCTTGTTGCATCTTCTGTGGCCGGTTTCCTGTCGTTATTTTATCCGTATCAA 
GGGAAGTCAATCCCACAAGCTGCTTTGCTCAAAAACACTANTTTTGNTGTGTTCCTCAACAT 
CGCATTGGGCACAACTGGTTTAGCAGCAGCA 

SEQIDN0217 

AATGAATATCTCCATGACTAGAAAATTGTAGACATGACACATTCTTTTCTTCTGCTTTGCAG 
GCTCAGTGAAGCTTTCATTTGGCTAGATTCGTGGATTTTGTTATCAGT 

SEQIDN0218 

CAAAGAAGAAGATGGGTCGTCATCATCATCATCATAATGAAGGAAATAGACCATATGATGAT 
CCATTCTTGGCATGTTGTTGTTGTCCTTGTTTTGTAGTTTCTTCTACTTTCTCTGTG 

SEQIDN0219 

GNAGGTTCATATCTCAAGGTCAAAGAAGAAGATGGGTCGTCATCATCATCATCATAATGAAG 
GAAATAGACCATATGATGATCCATTCTTGGCATGTTGTTGNTGTCCTTGTTTTGTAGTTTCT 

TCTACTTTCTCTGTG 
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SEQIDNO220 

GGAAATACCCCACGCTTCAGCAGATCAAGGAANCCTTCCTGAATCAGCTACTTCTGCTGCAC 
ATGAGGCATCTATTCGCGAATTTGCNGAGGCTGTCCGTGCTTATCGAGCTATTTTCCATGAT 
TCAGAACAGCAACTCTCTAGACTTGCACAAAATGTACCTAAAATGCATTTCGAAGCNGCCCA 
GCAGCACATCAAGAAACGACTTGCTTCTTCCAATCTTGTTGCCATG 

SEQIDN0221 

GAAAAGCAAAAATAAGNCGACATGGGGTTTATTTGGAAGAAGAAGATTGGGACTTTGAATAA 
TAAGAAGCGTAAGTGGGAGGAGAATAGATCACGGCGGCAGCCATAGCTCCAGGGAACACTAC 
ATATTTAGCTATCAGCGCCACCTTCCTCACCAATGGCTNTCCCTTCATCAACGCCATTTGC 

SEQIDN0222 

NCTNNAAAGGCCACTCAAGTTCCTGATATAAAANGACATAGTATTTGCACATCACCTTTGCA 
CATCCTTCCATTTTCTTCATATCAACTCTAGATCACTCATAATATCCGATGTAAATGTCATG 
CAAATAGTTATTGTACTATATTGTGTAAGGAATAAGGACAAGAT^AAAAGTCTGTACATGTTC 
AGTACAGACGCAATTTTTTTTTCCAATATTTCCAATCCTTGGTTGCC 

SEQIDN0223 

TTTGCANACAGTCCCNTACTTTCCCGACTACTTNCAATAGGATNCTGAAGATGCCTTTGATT 
CATCTTCTGACATTCTAGCCTATCCGTGGNCAATCAACACAAAGTATTATAATGCTGATGGT 
TCTATTTGGATGGCCCATCTTNATGAGGACTTNTCCATTGGAGCTTTGCCAGCATTTGACCA 
CCTTATTGCATTGGTGTTGGTCTTCGATATTAGTGATCTCTCATCTTTTGCTGCGCTGAAAG 
ATTGGGTTTCTCGCACCGACATCC 

SEQIDN0224 

ACCCAAATCCTCCTCAACCTGAACAATGTTGAAGAGAGCCAACCTCTACCAGGTTGTAGTTT 
GGTTGCAAATGAAAGAACTCCTATGAAGCTCCTGTCTGAAAGTGAAGTAATGCTTGAAACAC 
C T GC T C AG C C C AC AC C AAAG AG AT C G G T G C C AAT T AC C G AAAAT AAG T AC AAG AGT AT G AC A 
TGCCAAAACTCTGTTGTTTCCAATCTAATTGTCAAAAGGTCATTGGATTTTTCCACCTTGGG 
TGGTGAAGAGATATCTTCAGATTTGAGTTCTGGCAGTATAGAGCATCATGAAGATGTAGATA 
ATGCCC 

SEQIDN02 2 5 

TACCGGATGANTTTGTNGATGATGANCATTCCCAGTTTTTGNTTGCACCTGCTNTTTTGGCC 
AGNTTTCNATTGAGAGAGAGGAGACGCNAAAAGCATGCTGCTGNTTTAGNGAAACAAGATGA 
TGAGGNAACCGTNAAGCTTGAAAATGCTGCCCTTGAACGCTCTAAGTCAGTTGACTCTGCTG 
AGCTGGGGAAGTATAGCATATGGNGGAAAGAGAATGAANATGAGAATACTGATTCAAAGGNA 
CNCTTGANGCGGGACCAAATGGNTACNGNAAGGCTGTNTATAANCATNGCAACAATGAAGAA 
AAAGATNGGCTTGGCTCAAGAGTTAGAGAATCGGC" 

SEQIDN0226 

TTTCTTGCNTACAGGATNTCAAGGCATTGTTCCCCTCTGATAGAAATCCCTTTTATGCTGGT 
TTTGGAANNNGAGACACCGACAAGCTCAGCTACCTCAAGGTTGGAATACCTGAAGGAAAAAT 
C T T C AC CAT C G AT C C AAAG G G T C AAAT T C T TAT G AAC C AC C AC AT AG AT AC AAAAT CAT AC A 
CCTACATACATGGTTGCGTCGATGACATGTTTCCACCCTTGTCCTCACGTGAGCAGATTTGT 
TATGGACAAAGTGTTACTATTACTAAGAGGAGGTATTTTCGCCGTTGATGGGTAAATGACTG 
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GAGAGTGATGCTACTGATGGATTCATCGTTGCGAATGACACACATACTGGAATTGTGATGAC 
CTGCATTTGAAGAAAATTCTGTCTCTGAACCTTTAGTAGGGGATATGGTCTCTGTATAACTG 
GTTTTGTCAAGAAATGTCCACC 

SEQIDN0227 

TATCAGNCGAATCTAATTTTGTACCCGGTGGATTGTTATGTGGTCCTCAATGNTNAAGNAAT 
ATGNNCGNTNTTGT 

SEQIDN0228 

TTATGGCTTTCACTCCAAGAAACCCTCGTGCTGGA7\AGCCACCTGATCATTACATAGAATAC 
ATGCGC 

SEQIDN0229 

CTCCTATACCTGANAGGACTCTCACTTTGGAGCCTCCTCTAGGTATNTAATTCCGNCCTTTG 
NNGGGAGACTGACATCGGNTACATGCAGCGNNGCTTCAACTGTAAATATTCCACCTTNATGG 
CCTTGCCNTCGT 

SEQIDNO230 

CACCCAGAAAATNATCTATAAAGTATTATGATCCAGGACGAGCTGACTAAACTAGCTGATGA 
GGAAGATGACGAGGAAGAGGAAGGCGATGCTGAGT^GGATGTAAAAAAGCCTTCTGGCAAAG 
GTGTGAAGGCCTGAAAANACATGGGNAGANTGTNANCACANNANAGGCCCNCNACNCTATCN 
ATCAATATCCAACCTTTCTCTTCCTCGTGAATTTGTGCCTTGTGAGTTCAACCAGCTGTAAT 
CTATTC 

SEQIDN0231 

GANNCCNTTNNCTTNCTAANTAANNGCAAAAATAAGCGACATGGGGTTTATNTGGAAGAAGA 
AGANTGGGACTTTGAATAATANGAAGCGTNAGTGGGAGGAGAATAGATCACGGCGGCNGCCA 
TAGCTNCAGGGAACACTACATATTTAGCTATCAGCGCCACCTTNCTNACCAATGGCTCTCCC 
TTCATCAACGCCATTTGC 

SEQIDN0232 

GCCTTGANTTTGCGCNCCNAACAGGATGATCTTGTAGATGATGAT CATTCCCAGTTTTTGGA 
TGCACCTGCTTTTTTGGCCAGAAGGCAATTGAGAGAGAGGAGACGCGAAAAGCATGCTGCTG 
CTTTAGTGAAACAAGATGATNAGGTAACCGTNAAGCTTGAAAATGCTGCCCTNGAACGCTCT 
AACTANGCTGGCTTTTTTGTGCTGNACCAGAATCCCTTATGNlSIGNACAlNllSINANTGATANTGA 
GAANTCNNANTCAAAGGTACGCTTGATGCGNGACCAAATGGTTACGGCAAGGCTGTATATAA 
GCATTGCAACAATGAAGAAAAAGATTGGCTTGGCTCAAGAGTTACAGAATCNGC 

SEQIDN0233 

TTGGACTTGAGTGCTTGTAGATGGTGCCTTTGCCGATACCCACGGCATCCGCAATCATCTCG 
ACGGTGACGCTGTCTTCACCCTGGTCGAGGAACAGCTTGAGTGCGGTGTCGAGAATTTCCTG 
CTCGCGGCGGCGAAACTCACGGACCTTGCGGGGTTCTTTGTGCATAAGAAAAAGGTCTGCAG 
AGGTGGATATTGGGAGGGTGGCGTGCCCCAAGCCGTGCTTGTCGGAACGTTTTACCCGGTGC 
GCGGGATTATCCCGACTGAACGGTCGTTGGGCAACGCCTATATGAACAACCTTTGCAC 

SEQIDN0234 

TCAACCTGAACAATGTTGAAGAGAGCCAACCTCTACCAGGTTGTAGTTTGGTTGCAAATGAA 
AGAACTCCTATGAAGCTCCTGTCTGAAAGTGAAGTAATGCTTG7\AACACCTGCTCAGCCCAC 
ACCAAAGAGATCGGTGCCAATTACCGAAAATAAGTACAAGAGTATGACATGCCAAAACTCTG 
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TTGTTTCCAATCTAATTGTCAAAAGGTCATTGGATTTTTCCACCTTGGGTGfeTGAAGAGATA 
TCTTCAGATTTGAGTTCTGGCAGTATAGAGCATCATGAAGATGTAGATAATGCCC 



SEQIDN0235 

TCATGTGCATTTTGACTTTGGAGACTCAACACAGGGGTTGGGTCTGTCTAGGACAGGTGCAC 
CTGAAATGAAAAGACCATCTTGATGCATCCTATGTGCTACATGTTGCATTTATTCAAGGGTA 
AAAAGGTCATTTGGCGGACCAATGATAGTTGAGGGCAAGTGAAAAAATGAAAAAAATGAAAA 
AAGGGAAAAAGAGAGGGTGAAGTGTGAGGATAAAGCGAGCGGGGCCTAATTAGGTTATCTGT 
TACATTTTTGT 

SEQIDN0236 

TCACGTGCATCTGACTTTGGGGACTCAACACAGGGGTTGGGTCCGTCTAGGACAGGTGTACC 
TAAAATAACAGACCATCTTGATGCATCCTATGTGCTACATGTTGCATTTCTTCAAGGGCAAA 
AGGGTCATTTGGCGGACCAATGATAGTTGAGGGAAAATGAAAAAGAAAAGAAAANGAGGTTG 
AAGTGTAAAGATAAAGCGAGTGGGGCCCGATTATATTTTNTGTCACATTCTTG 

SEQIDN0237 

AGCAGNCANGGATNAAATGGGAAAAACNTGTCAAGCCTANCTCTACCACCAAAAGAGANTGA 
AAGATCNGACTNGAGCACACCACTNGATACATAGGTATCAGGCACATAGAAGATTAGTTACT 
GTTTGCCAACCGAAGAAATTCTTTCACTACTGATGGCAAGCATACCA 

SEQIDN0238 

AGAGGTATCCTTCTAGTGTGGATTTTGATACTGGGGTTGATGATTTGACAAACCCGAAGTGG 
TATGTGGTGTGGTCTGCAAATATGAACACTCACATTCTTCCCGAATGCGTAGTTAGCTACAA 
ATATGGACGTCATATGTCAGGTCAAGCAAATNGTGCTTCATCCATGAAGTGGGCTCCTCATG 
CTTCAAATGCAATGGGTACA 

SEQIDN0239 

GAGNGGCAAGTCCTGGCGCTCTATTTCCGAGAGNAGAAAAAAGAATTTTGTTTTTGCT 
SEQIDNO240 

GGGTNCAGGCNTCTGGCAANTCCTCGGTGTCANNGNTACACT^ACTGGAGATGGAAAGAAAGT 
TTATGTCGCCAAAAATGGGCAGGAGTTTTCTGGTCAAAGCGCATATAGATGTTACAGAAAGG 
AGACTGGAGCTGGTT 

SEQIDN0241 

GGTTCTATCTAAANNATTTAATGACTTGCAGCTGAAGGATGGTNTGTACACTAGCAAAGCTG 
AACTGCGGAAACGTATCAGGAAACTCAAAAATGGGCCAGGAAGAAATCACACTGCAGGTGGG 
AGGGTTGAAAAGTCTAAAGGTTTTTCTCCCAATAGCTTTGATCGTGTGCTCCTTGATGCTCC 
TTGCTCTGCATTAGGT 

SEQIDN0242 

TANANTACGACCGAGAACATCCTNATCAAATGCTCTCAAGCTGTTCTTCATCAAAACCTTTT 
TTCGGGTTCATCTTCAGTACCATTACAACAATCTTTGATTGAAGCTGCGGGTAATGTTGTAC 
AACGGGCTTNGAACTCGGCCCAGCCCGAACATCAGCCCAATGTGGTAAAAAATCCGAAAAAA 
CGAACCAGGGCATCAAGGAGAGCGCCAACTACTGTCCTTACTACTGACACCACAAATTTTCG 
ACAAATGGTTCAAGAATTCACTGGCATCCCTACAGCTCCGTTTACTGGTTCAGCCTACACTC 
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GCCGCCTTGATCTTTTTTCTACAGCTGGCTCAGCGATGAGGTCGGGTCATTTGGATACTCTT 
GGGCCACTTTACCCT 



SEQIDN024 3 

AAGACAGGGATGGCAGTGCTGAGAGNAGGGCAAAGATTGAGCAATGGAATAGGGAAAAAGAA 
GAGGCAGAATCTGCTAAATACAATAATTTTGACACTGATAATGGCAAGAGTGATGGTGGTGA 
TCACTATGGAGAACAGTTTGATGACGATTACCCGAAGCAGCAGTAGGTAGCAAATGGAAGTT 
ATGGCTACTGATAGTAGTGTTACTCTGGGTGGAGTACAGGTCCACTGTGCTGTGATTTTGAA 
AAAAGCATAACCCTTCTATTGTCTTCTTTTTACCATGT 

SEQIDN024 4 

GGATCAGGAAGGGCATGTGGCTGATGCAGGAAAAGAAACATTGACATCTGTTCAAACATCTG 

AAAT T GAAG AT T G G AC AAAAT AC AAGG AT G ATG AT AT TAT G C AAC AG C AAT C T T C CAT C C AG 
GCTGAACAAGCTGTAAAAACTCAATTTGTTGGCGATAAGGAACCTTTGTCTTCATTAGAAGC 
TGAATACCATCTGGGAAATTCAATTTTGCTGGAGAAAATAAAGGTGCTGAGTGAACAATATG 
CTGCCCTTAGAAGAACACGTGGAGATGGAAATTGCTTTTTCCGCAGTTTCATGTTTGGTTAC 

CTTGTATGC 
SEQIDN024 5 

CCGNCAAACAAAGTAAAAGATGCAGGATCAGGAAGGGCATGTGGCTGATGCAGGAAAAGAAA 
CATTGACATCTGTTCAAACATCTGAAATTGAAGATTGGACAAAATACAAGGATGATGATATT 
ATGCAACAGCAATCTTCCATCCAGGCTGAACAAGCTGTAAAAACTCAATTTGTTGGCGATAA 
GGAACCTTTGTCTTCATTAGAAGCTGAATACCATCTGGGAAATTCAATTTTGCTGGAGAAAA 
TAAAGGTGCTGAGTGAACAATATGCTGCCCTTAGAAGAACACGTGGAGATGGAAATTGCTTT 
TTCCGCAGTTTCATGTTTGGTTACCTTGAGCACATTCTGGAATCACAAGAATCAAAGCGAAG 

TTCATCGCA 
SEQIDN02 4 6 

GCAAACCTGAAAGAAANGANTGGCAATGATATTTNNTCTGATGGCNAAGGTGAANCCAGAGA 
TTACTTTGGTGGCGTGCGCAAACCACCAGGTGGAGAGAGCAGCATTGCACTAGTTTAGATGA 

TG 

SEQIDN0247 

GAATTTTAGATTTCAATGGCTCTCAATGAGTTACACGGAATCAAGNTCTAAAGTACCTTTGC 
GGATGCGAGTTTGCTAGAGGCTGGTCTCTAATGTTGG 

SEQIDN0248 

CAGGGTGCTTTGTGACATATCCCTGCACTGATCACCCAGGTGACCTAACTCTGGTCTAAGCT 
CTGCCTAAAGGGGCATTGTGACAGATCTCTGCACTGATCACTCAGGTGATGTAACTATTGTC 

TAGGCTCTGC 
SEQIDN024 9 

GAAGAGTGGACTCTTTATGAGCAGGTAGCTGTTGCAGCTATGGATTGTCAGTCTCTTGATGT 
GGCAAAGGACTGCATAAAGGTATTGCAAAAGAAGTTTCCAGGGAGCAAAAGGGTTGGTAGGC 
TAGAAGCTATGTTGCTAGAGGCCAGAGGATTGTGGTCAGAGGCAGAAAATGCTTACTCAAGC 
CTTTTGGAGGAAAATCCCTTTGATCAGGTTGTACATAAGAGGAGGGCAGCCATGGCAAAGGC 

GCAAGGCAATACGTCAGCAGCAATTGACTGGC 
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SEQIDNO250 

TTCTGCTGGAAAGTACTATGATGGTAGATTTGATGAACCCCAACAACAATATTTTTTGGATG 
CTTGTTTCCTTTGTAAGAAACGCCTTGCAG 



SEQIDN0251 

AAACTAATATACGAGTTGTGTCTGCATCTTCCTCAACTGGTTTCATAGATTGATCAATTGTG 
GCACCGGCAATGTGGTGGCGAGGTTTACATCCTCGAATTGTTCCACCGGTCAAAAGATGACC 
TGCTTTTCTGAAATGAATTCTGTTTCCCAAGCAG 

SEQIDN0252 

GCACATTTGAGAGCNCNNGCGCANTGNCATNTCTTNAGCAGNGGAAGAGTAANTTCTAGATG 
TAAATACCCTGCTTTCCCGTAAGAACTGGTTTATATTGAAAGCAGAAATGCCTCTGCTGGCC 
AATTTTCGACTTATAATTCCAGACATATCCACTTCCTC 

SEQIDN0253 

AGCAAGTGAAGGATTGGTTTCTGTTCATGGTGACGCTGTGAAGAGAATATGATGAGTCTAAA 
TCAGGAGTGAGGCATTCTCAANTTCATTGCTCAGGGAGCAGAAGTTGATATGTAGATTGCTA 
CTATTTGCAAGAGCACTTTCCTGGCATGTTAGTCANAGTCATGTTTTTTGTTCCTGATCAGC 
AGTCTTCTTTNACTATCTGCCCTTTGNAAGAGTTAGCCATACGTTAGAGCAATGTGTTCTTT 
TCAATGTTGGATATTTATTTGAACTTGATC 

SEQIDN0254 

AGTANGCGTAGGGAAGACAGGGATGGCANTGCTGAGAGGAGGGCAAAGATTGAGCNATGG7\A 
TAGGG7\AAAAGAAGAGGCAGAATCTGCTAAATACAATAATTTTGACACTGATAATGGCAAGA 
GTGATGGTGGTGATCACTATGGAGAACAGCTNGATGANGATTACCCNANGCAGCANTAGGTA 
NCAAGATGGAAGTTATGGCTNCTGATANTANCGTTACTNTGGNNGGAGTACANGNCCANTGN 
NCTGCAGATTTTGNANANAGCATANCCCTTCTATTGGATTCTTTTTACCANGT 

SEQIDN0255 

AAGCTAACAGAATCGTTTGTGGAATAAGGGTGTCGATTCCGAGAGCTTCCACCCCCGTTATC 
GGNCTCATGAAATGCGACTAAGACTAAGTAATGGAGAATCCGATAAACCTTTGATAGTCCAT 
GTTGGACGACTTGGAGTTGAGAAGAGTTTGGATTTCCTCAAAAGGGTCATGGATAGACTTCC 
AGATGCTCGCATTGCTTTTATTGGAGATGGGCCATACAGGGAGGAATTGGAGAAAATGTTCC 
ATGGCATGCCTGCCGNGTTCACAGGTATGTTACTAGGAGAGGAGCTTTCCCAAGCATATNCC 
ANCGGNGATGTTNTTCTTATNCCTTNANAGTNAGAGACACTGGGGCTCGTCGCTTTGGAGGC 
CATGTCATCAGGGCTTCCTGTAGTANCTGCCCGTGCCGG 

SEQIDN0256 

CGNTTTNTNCTCGGNGNGTCAGCTNNGNGGANGCNCTGGGTGCTGGTTCNNAGGNCTNATGA 
AACGCTCNAAGGCAACAATCTGGTTATGACAACTGCGGGAAAAATTCCCTTTCTGCGCAAAC 
TCTCAAAGCGATGGACTNGCNATCANAGCAANTTTTGCATCTGC 

SEQIDN0257 

ATGANGANGAGGATGAAGAAGATTATAAGCCACCACCTAGGAAGCAATCTGATAATTCTGAT 
GAAGATGCGGAGTCTTTTCCGTTGAAACGAAAGCTATCTCCGAAAGAAGAGCCTGAGCCAAA 
AAGGTTGCAGCGGATTGCTAAAGGCTCAAAGTCTCGAGACGGTGTTTTCGCTGCTTTGTGCT 
CAACC 
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SEQIDN0258 

ATGATCAAGCCCTAGAATTCGCGAAGATGCTCGATCAATCGGGAACTGTAATTGTTTTGGGA 
AATATCGTATTCCTGAAGCCTGACCAGGTGGTGAAAGCCATGAAAGGCCTAATGCCAATGCC 
CTTGGCCGAACCAAATGACCCAAAAATGATGAAGGAGCTTCAACAAATGGAGGAGAAGAAAG 
CAGCAATTGACAAGAAGGCAGAATCATTGGTGCGGACAGAGTTGTGGCGTGGACTAGGTTAC 
TTTGTGATTCAGACTTCAGCTTTCATGAGGCTCACTTTCTGGGAGTTATCATGGGATGTAAT 
GGAGCCTATTTGCTTCTATGTCACATCCATTTACTGCATGGCTGGGTATGCTTTCTTCCTTA 
GGACCTCCAAAGAACCTTCTTTTGAAGGGTTTTTCCAGAGCCGGTTTAGTGCAAAGCAAAAG 
CGATTGATGAAGCTTCATAAATTTGGATCTTCATAGGGACCAAGAGCTCCACAGAGCTTGNG 
ATCCTCATTCGACGATACCTGGTGGAAACACC 

SEQIDN02 59 

CAGATGGTACTGTAAACATGTATGTTCATCATGAGATTATTATTCCTGCGNTTCCTGTCTGC 
AC AG CAT G GAT C GAT T G C C C T A 

SEQIDNO260 

TTGNTTACTCNGCCCTTGNATTTCAATGNGCTAATCCATTANCCCNCACGGAATGACGNTCT 
AAAGTACCTTTGCGGATGCGAGTTTGCTAGAGGCTGGTCTCTAATGATGG 

SEQIDN02 61 

CAATAANTTTATTTGGAGGCTTTCCTTCCCTGCCTGGTTTGATGTCAATGACCTATCTGAAA 
ATGCTATTGATGATGATGAGGGTTTAGATGCTTCAGCAGCATATGTGGCGAGTTTGTTGGCT 
ACGGAGCCCCCTCACATCAAACTTGGGGTTGGAGGCTTCAGCATGGGCGCAGCGACATCTCT 
TTATTCTGCAACTTGTTTCACTCGTGGGAAGTATGAGAATGGCAACTCGTACTCTGCCAATC 
TGAGTGCAGCTGTTGGA 

SEQIDN02 62 

GGATCAGGTTNTAGCAGATACACTATAATCANAGTTGNNGTGGTCATGGGGCATGGNTATAT 
TTGGNGGAAGGGGTGGAAGCTTNCCGAA 

SEQIDN02 63 

GTGTCCCAGCAAGGATTACCCAGGTGATGTACCTCTCATCAAGGCTCTGCCTACAGGCACAT 
TGTGATGTATCTCTGCACTGATCACCTAGGTCATGTAACTTTTNTCTAGGCTCTACCTACGA 
TGGCATTGTGACATATCTCTGCACTAATCATCCAAGTGATGTAACTCTTGTCTAGGATGTGC 
CTAAA 

SEQIDN0264 

CCGNTACTCTCCGCTNGACCAGNTCGTTTNCTTCCCCTTTTTCAGGCTGGTGACACACTANT 
ACAGTCAGTANGACAACTTCATCACTGATTTTGAGACAAAGATCAATCTTNTCAAGCTTGCN 
CATTTTGCGGTCATTNNTTCTCNGGAANACCCNGANAAAGAGGCTGNTATAGGTTACCTTGA 
AGGAGAGACTGAGAAACTTCNNNATACTAAGGAGACACNGATAAAGGAGCCGATTCTTTATA 
T 

SEQIDN0265 
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GGCTGTTAGTGGCTCAAAAATTGTTGGCTCAGCCAAAGCAGAATCCATTGAAAGTGGTGAAA 
GGACTCGTCACATGCAGCCTACACTTNCGAATAGTCCACACCCTTCTCTTTCTTGCAATGCT 
GTTGTATATTCTGCATATGAAGCATCCAAGGACGAAGTAACCCAAAATAATGCACCAGCTAC 
TGATGATTGTGGATTCTTCGAGTCAGGCTATATGCTTGCGAACGGGACAGGGCCTCCTATTG 
G AG AAAG C AAC TAT G AC G AAG C T G T T G AAT T T GAT C C AA 

SEQIDN0266 

ACAAATGGTTACAGATGTTATGGAAAATCTTGTCAAGAGGGCTATAATGGCTGAATCTGAAA 
CTGCTTTAGAGAAGGAGAAGGTAACAATAGGTCNTGAAGAGATTCAAAGAAAGGCGCTTCAG 
ATTGAAAACATGTCAGGTAAGTTAGAAGAGATGGAAAGGTTTGCTTTGGGTACAAATTGTAT 
CTTGAATGAGATGCGCCAGAGAGTTGAAGATTTGGTCGAAGAAACTTCTAGACAGAGGCAGC 
GAGCTNCAGAAAATGAGCAGGAGCTTTCTCGTG 

SEQIDN0267 

GNNTNTGGANGCTGNACATNTCATCCTCANCNCAGGCCTANNCTTAGNNCNAGGNGCCNNCC 
ATNNTNCAGNTNNCTCTTNCCGNNATTCTANTNATTCGTGCACATGNNGAAACCTATGCTNT 
TGCGNCNGCTNNANGNACANTCANNNCTGCANNGNCNGANCCTTCNTGCNCANCNTAATCAA 
CCTTNCAACNGCATGATGACTCTTCATGCATAGCCATATGNTATCTTCATTACGGGCTTTTT 
CAGACATACCGCTTCGTTAGCAGGCATCTTACCC 

SEQIDN0268 

GATATTCGTAGGGCGAGGACTGTTATCTTACAAAGGATCATCAAACCCCCAAACCACTAAAG 
TGCTGAAATTTGCCTTAGCAGCAGNGAACATTTATCTGCTTTTCATAGTTTGTGATG 

SEQIDN02 69 

GGGTCAATACTCTGTCTTCACTGCGATCGATATTTCGCGAATGTTGCGGTGAGGGACGAGCA 
TTTCAAGACGAAAAAGCACAGGAAGCGTGTGAAAATAATGATGGGCCCTGCACCACACACCC 
AACTTGATGCTGATTTAGCTGCTGGAATTGGCATGCCAGATAATGGTCCAAAGCTAATGTCG 
ATGAGTTGAGCTTCTTTCGTCCTGTTTATAACTCCTACATTACTGGTAGAGTTCTTTTGAAC 
TTTGAGAATTTGTCTGAGGAACATAGGTTTTTGTTAGTCTACCATCTCTCTCTCAGTATAGC 
AAGT 

SEQIDNO2 70 

TGATGACCTTTNNGNATCTNGTAATATNTGAGAACAATCCAAACGTTGAGAGCTGCAGCAAT 
TGATCAAGTTACCCTCTTNGAAGAACAGAAGATATTAGCTACAGAACAAGCACAGATGGTGA 
AGAAGCTTGGTGATTCAGAAACG7\AGACTGCAATGCTCAAGTCACAGGCTGAAAGTTTAGCA 
AATTACTGTGATGATGTGGCCAGCACTAATAAAACACGAGCGCTGCAGAAGGGAGTCTGCAA 
GTATAGTTCCTATTTTTTGATACAGNTGGTATTGCTGGTTATCGTCTTTGGACTGTATGTTT 
TGCAGATGTCACCTGATGCTGTTGAAGTTGTACCGACATAATTTTGAGAAGTGAGCCTTTTT 
CCTTTTTCTTGTATTTTCAACATAAAGCAACGATGAACG 

SEQIDN0271 

AGGGTTATTCGGGTCGGACCTGGCGAATGCAATTGCTAAAGATACAACAATTTTTGATCGAG 
GTTTAGATACNCATTTGAGACCTACCATTGATTGTCTTAGGAAAACTTTGGGCACCGATGAA 
AATGTAG 

SEQIDN0272 

CACTCAAANTCCNGNCAGAATCCGGNGAANTTTTCGGCGAGACATTCCAGTAGAGTTCTTGT 
CCGAGGTTTTGACATTTCAGATTCATCGAGGTCTATTTCTTCCTTCCTCACGTTGTTTGTGC 
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SEQIDNQ273 

ATTGGCCGGTCTTGGACTTCAAAAAACNNTCNCAGAGCTTCGAAGTATCACTTCTAAACCTC 
AATCGGAGAAGAAAAAATATAACANAGTTGACTATTTCTCTACTCCTTTGCGCCGTTCCGAT 
CGATTGAAAGGCAACACCCCTCCCGAATCAGAATTGCGCCGTTCGGGTCGCTTGAATGAGAA 
GTCCTGCTACTCTGCTCCACCAGCAAAAAGGAAATTGGGGCTTTTTGAAGAAGGAGATGTTG 
AAGAAGATAATGAGAAGAGACCTGCTAATGCACCTCTCCTGAGAGTGAAAGATGGC 

SEQIDN0274 

GCCNGCTGTGNNCTGCAGTTGTTGTAAAGGTTGAAGTAGCTCTAGACAAAAGCATTTGCATG 
TTGACCAGATGAGCAGAACTGATGTTATTTGCAGTAGAAGGAGGAGGTTTCTTCTCGTCTTC 
AGCTTCTGGATATAGTAAGGGCCTGACCCTTCTACTCTTGGGTCAGAAGAACGAAGAGAAGC 
CCATGAGAGTTGCACCGTGGAANCAGTACCAGTTGGTGGACCAAGAAACTGATCCGGACCTC 
CAGCTGGCTTCCGGGAAGAACAGGGTTGTCCGCGGGTGCGCCTCCTTTGTATGCTTTGGTCG 
CGCTGCCGCTGGACTTGAGAGCCCATCTCCCC 

SEQIDN0275 

CGTTCTNCTGGATNGTTCCTGGCTATATTATGGGAGGGGAAAACAGGAACAAAGAGAAAGCA 
AGATTGCGAAAAGGTATATCTATTCTTGTTGCAACTCCTGGACGTCTTTTGGATCACCTAAA 
AAACACATCATCATTCTTGTACACGAACCTGCNCTGGATTVATTTTTGATGAAGCAGACAGAA 
TTCTGGAACTTGGATATGGTAAAGAGATTGAAGANATAC 

SEQIDN0276 

TCCAGGATGATGGCACTCCTGTCTCAATATTTGCACTTACGGGGAGTAATGCAAACGATGGA 
CATTTAGCTGCTGGCCGAAATGGAGTCAAGCGACTTCGCACTGTTAGGCATCCAAATATTTT 
GTCATTTCTTCACAGCACCGAAGCAGAAAATTTTGATGGTTCTACTACCAAGGTTACCATCT 
ATATTGTTACTGAACCTGTCATGCCACTCTCGGAGAAGCTAAAGGAATTAGGA 

SEQIDN0277 

ATGNGCAAATTTGCGATCCNAGCGTCAGATGAATCCATTACCCAGGAGATTGCTTCANATTT 
TCAGGGNTGGNTGNATGATCTAACTGATGGTGGTGTTGAGTACATGCCTGAAGANNAAGTAA 
AGGNGGCTGCTGCTGAAAAGCTAAAGATTTCAATGGAACGGATAGCATTACTAAAGGCGGCA 
AGACCTCCCCGAAGTCTCCAAAATCTGATGATGAAGAAGAAGAGGAGGAAGACGAGGATGAT 
GAGAACCAAAAGAAAGAAGACATGA 

SEQIDN0278 

TTGTCTAAGATAAAAAATGTAATAGTAAAGAGAGCTGCAGATGAAGACATGGAAACTGCTTC 
TATGTTGCTTAGGTGTTGCTATAATTTTTATAAGGACACTTTTTGTGCATTGCTCCCATCAG 
G T N T AAAC C T T TAT AT G G T G C CAT C T C AAT T T G C T AC AG AAA CAT AT AT CC AAC C T G GG AT A 
GATGCAGTTGACATACTCGATATGAACACTTCACGGAAGCTACTTTTGTGGGCCTACACACT 
TCTGCATGGCCATTGCACAAATGTCTCAGCTGCTA 

SEQIDN0279 

GCTTTCTTGCCTGCCGTAGACACAGTGNGAAGGGNGAGTGCCTACATGAATGNTTTAGAGTG 
AACCCTGATGGTGTCAAAGACAAAATTAGCTGTGGTGAGNTTCTGGATNTGACTCTNGAGGA 
TGNCGATAAATGCATAGAGCTTATTTNTACGCCGATCCGCAAAGATGCA 

SEQIDNO280 

GCGATACGAGGCGAAAAAACTAAGCTTCCGGAGAGTGTGAAAGCAGATNCCCTTACTAATGA 
AGCTTTTCTTGACCGGGGGTTTACTCGCCCCAAGGTTCTGATCATTCTCCCTCTAGCAAGTG 
TTGCATTTCGAGTAGTCAAGCGGCTGATTGATTTGACACCTCCTAAATACAAGTCTAATGTA 
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GAGGAGCGTGAACGTTTCTATAGAGAATTCGGGGCCGGAGTAAGCAAAGATAGGGAGGATGA 
AGATGCCGTCGAAAGCTCTGAATCAAAGAAGAGCTCTUVAACCATCTGATTTTCAAGCATTAT 
TTGGGGGAAATAACAATGATCACTTCATGCTAGGAA 

SEQIDN02 81 

GCGGCATGTGAAAATCAACTGNTTGTGATATCCCACCTACTGGAC 
SEQIDN0282 

GNGTACGGGGNCCGGGCATAGATATGCCTGNANGGAGTNNGACAAAGCTTGCAGAGTGGNTC 
ATCCTTGTCAGACCACCCCTGCATGTATATNTTCTNTTGNTTNCCTNTCCCAGTACAAAGAT 
GGACCTTACTCCAGACAGCGTATGGTGGTAACGGATAGCTAATTNAGTGCANAGGTGTTGNC 
CTCCTCTTACTTATACCTTTCAGCAGTCCCCCATTATCGTGG 

SEQIDN0283 

GCTNACTNACATAATAATNANNCCNGAAAANTAAAACTTCTTTTNAATTATAATCATAAGCT 
CTACTCGGAGATGTGAACAGCGAGTTTTAGGTGGACTTNTGAAAGAATGCCTCGATTCGTNG 
TGNTCCAGAAGGAAGCGGCTTCTCTGTTGATAATCGAGGACGATTTTGAACCTTAGGAGAAG 
GATCANACGGCTGTGAAGGCACGGGAAGCGAGTCGAGAAGGAAATCGTTCGTAGGTTGATGC 
CTTTTCACAGCAACTC 

Group 4 
SEQIDN02 8 4 

GGCCATCGGAGCAAAAGAGAGCAACTTACATTCTTGAACTACGTGAAGAATCTTCAGAACCC 
T 

SEQIDN0285 

GAAGGTGAAAATGGATATNGCGATATCTCAAAGGCACNTNAGGTATGGCACTTTGTT 
SEQIDN028 6 

CTGNTCGAATGGGATATGCATATTCATATGTCCTATTGTACTAATCAGAGTTTCAAGATTCT 
GGCTT 

SEQIDN0287 

TGGATAGGTNAGCNANGAGCANACGANANNCCTGACNGGGAAAGGGATGCANTCAGACTCTC 
ACTGGCTTCAGCAATTCTT 

SEQIDN02 8 

TTGAAATANCNNNTGNNT^ANNCTNACATTAGCCNCTCTGTTGTGAGGAAAGGCCTATTCCCC 
CTCTCTATGTACTTCATTTCTGNCATACAT 

SEQIDN0289 

TGATNNATGCTCTNTAATTGCCATACTCATTGGTAATTGTGTTGATGNGCCTTNATAACGGG 
TTATNATGGCCTNCTCTCTTCTATTAGCGCCAAATGTAGGAAAGTCATTAGTTTGTGTTTAG 
TTCAGGAACAGACATATTTCAGCCGTGCCACCGGACATCGCATGATGTCAAACTCTGNGAAC 
TAATCTCACTAGAGACGAGAAGACNATGGCCCGCTAGT 

SEQIDNO290 

GCAGCAGAAGANATGAACCGAAATGAAGGCCTGAGTTCGGCCCCAAACAGCCGATTCAACAA 
C AG AAAT C AAT G C AC AG AT T C AAT C T CG AG C AG AAT G T 
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SEQIDN0291 

GGCATGANAGGAACATTCACNCGTATGAGCACGCATGTTGCAGANTCTCCTTCGNGGGGCTG 
NTCCAAANATTCACCACTATGTTAGCCCAGGAAATTCNCCTCCCCNTGATNCTTCTGCTCTG 
CAGT 

SEQIDN0292 

NTCTGTGCCGGCTCNANTNNGGATACTACAGCCGAAACCCTANCGAGCGTATNNNNNAAGTG 
CGCAAGAGATTGACAGATTGTAANGCTGTTACNGAGAATGCTGNGTAGGGAAGTCCATAANG 
ACCGCGTGATTACTATGT 

SEQIDN0293 

NGGAGTAGTAATACCCGTGTGGATAGTACCAAACTCAATTACTTTAGGAGGGTATGTTGCTC 
AACCTACCAACTGGC 

SEQIDN0294 

CTTTNGGNAGTCCG7VACNCCCTCNNNGANAGACCAAANNGATGCGNNNNGCTCNTGCAAAGG 
GTGAGGANCNNNATNNTNGCC 

SEQIDN0295 

TTGCAGAATTGATGTGGTTGCTTTGCTCTAAAAGTTGGAACT 
SEQIDN029 

TTTGAAG1SICCTTTNANCNNCNCTNANAGGGGCTGNNGNTGGACGCANCACACGATTCACATT 
CTNCNCCTTAGNCGAACGTGGTGTTCGGAACAGTTTACATCACT 

SEQIDN0297 

ANCCCANGGTTANATGGNG7^ATCACACGATNACANANCTTCTCCTNAGCCGACGCCTGTAGG 
GAACAGCATACTCACT 

SEQIDN0298 

GNTTAGNNANCCNNCGGTNNGNGATNGGATGNNGNTNAGGGNCTGNTTCAATCCTGTATAGN 
GACTCTTTNTTACCCGTTGTGTTCCNCT 

SEQIDN0299 

CCTAGANAGCGNGCTCCNGAAGAGAATAAGGCAATNGCCAAAGTTGCAAAAGTTCATGCCCC 
TNCGTTAGCAGNTTGGATCAATTGGCACAGGAGGGCCTCAGCTNTGCCTCGAAGATCTAAAG 
CTTTAC 

SEQIDNO300 

GGCCCTGACGTCTCCTCTATATTTTATTTCCTATTTCATCTTTTTTGCTTCAGAAACAATGT 
NTCCTTTTATTCTCGGACCTTGTATTTAGCAGTCTTAGAACGTCGGTGACATTGTGACACTA 
GGTTTTGGGTGATTATGGC 

SEQIDNO301 

NTTNTAACATCACGCATGCATAACAAACTGTCAATTGGTGTGAATATTCAGAAGTCTCTTAT 
TCATATCAATNCTCAGGGGGAATATNACNACTCTCCAGGAAAAAGACGTTTCANANACGGAC 
AGCTGCNAAGAGATGCAGTATGACAAGAAATTCATTCCTCTTCCTCCGCCTCCTCCAGCCAT 
TTCACAAAGGGCTCCAGCGACTTGACAAAGTTTTGCCTGCCC 
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SEQIDNO302ATAAACTCTCAAGTCCTGGCGGCCAGAAAACCAGCTTTCAGGCGGGGCGGNG 
GGGCTCCCCCTCCCCTTGCTTCGTCTCTGC 

SEQIDNO303 

GT AGGAGT CGNG GATGAGGAN AGAAGN GT CCT GAGN AATNGAGGGAGAN GGTGGANGAT 
SEQIDNO304 

NCCCANTGNTTTGACNCNGTGGTGNGAGGGGTNTTAANATGATTNAGTGCTATTNGCTAGAG • 
TGGNTATAAGNCTTGGA 

SEQIDNO305 

CNCGATNGTAAACGCCCCGCANCGGNTATGGNTAAAAAGNAGACCCTCAACAAAATNANGGA 
ATTGANACNTANCNAAA 

SEQIDNO306 

ACNANTATNNGAAGGTAGAGNGTNTGATGGGNGAAAAACGAATNGGGACNGGGGGTGCNTAA 
ACNNNAGTCAGNTNGAAGAANATAGA 

SEQIDNO307 

GNTNNATNAGCACTCTGTTGTGAGGTAAGGGNCTGGTGCCCCTCGGGATGTANTTCANTATN 
GCCGGAGAT 

SEQIDNO308 

NTTTNGGGTGACAAGTCTTATGTCTCAGGAATAGCGCCATTCATNGGTCGCAAAAAGCTTGA 
AGAACAAACTGNTCTGATTTTGTTCAACTTTTTCTTCTAT 

SEQIDNO309 

CACCTNTCAACAGCATCCAGCNACTCTAANCGCNAGAAAAACANCCGNGCCTNCATTGAAAC 
CNCCATTTTGCTTTTGNTGNTCGAAGCNCTNNTCNNCAGATCNCGATNCTGAAAN 

SEQIDNO310 

CCAAAGTNTCCGGCTCCANAGGGTTAGCAAGNGGGANGATGGCGTNGGGNNAGCGAGAATGA 
AAGCCTTCATNATCCCANGNAGAGAACA 

SEQIDN0311 

CCCATTTTCANCNACCNAANGCAGCCTAGGTTANAACCTCTNNNNNCTGNACAAGCANCAGG 
CTTTAAAGNTGNATGANTGAGGTCGANNGCGANCNTCTCAGNTNTNCCAGTATCCTCGCGCC 

TGAACCTA 
SEQIDN0312 

ANCCTGCNTGTTGTAACCGCCTGGGNTACTAATTGTATNANCTCTGCTATAAATTTTTTTAT 
SEQIDN0313 

NNNNCTNNNNNTGGGGANTAGAACCATTTTGTTCANTTCACTTTAGNNTTTGTNATGNAATG 
AAATAATAGCTATATCCNTNNNNTGAANNAAATGATGGCTGNTGCTGNGGGG 

SEQIDN0314 

CTGTTTTGGGNGNCAAGGATNNNGNCTGAGGNNNAGCGCCNNTCNTTGTTNCGCNANNAGNT 
TGCAGAACAAACTGNTGCTGATTATGCANAACNTTGCCTNCTG 
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SEQIDN0315 

CCNGGANGNAGACCCNCTGNTGGCATCAGGNTATACTAGCNTCAACTAGGGAGTGGAGACCC 
TATNTTGACA 

SEQIDN0316 

NCNTNAGATGNNTAAAATGGTGNGNTGCTTNGGCTCTAANGAAG'NNGGGGNACT 
SEQIDN0317 

NTCGTNNNNNNNCTGTGTACTGNNATATGTGTCTGNATTACTCCTGNTGTAATGCATTGACT 
TATACGGGNCTTGGG 

SEQIDN0318 

TNGNTANGCCCCTATTNGTTACAGGATCNCTACTTTCCCACANAANATCGNCCATNGC 
SEQIDN0319 

TTNTAANACCCCATNNTGCATCTCACATAATGGACCGGCCANCAATANGTGAATTAGCTGGA 
TGATATTCAAACGAAAATTCATCATCTCC 

SEQIDNO320 

TTGNAAGCCCTAGTTNTANCCCAGCAGGGGCTGCTCCTGAAGGGCAATTTTACCCACCTTAT 
TATCCACCCTATGGGTATACGCCACCACCTCTACCATATCAACANTATTATCCTCJ\ACCTTA 
TCAAGCTACAACCCCACTCCACCTGGTGGTCAGCAAGCCACACATCAGCAGCAGCGGCACAA 
CAACC 

SEQIDN0321 

TTTGTAGCCCTAGTTGNTCCCAGGNGGGGCTGCTCGTGAAGGGNAANTNTACCCACCTTATT 

ATCCACCCTATGGGNNTNCGCCACCACCTNTACCATATCAACAGTATNATCCTCAACCTTAT 

CAAGCTACAANCCCACTCCACCTGGTGGNCAGNAAGCCACACATCAGCAGCAGCGGCACAAC 
AACC 

SEQIDN0322 

TGCCCAGGGAATGGGTATTGGGNGCAGTTGTACCGGGAACACTANATGACTATCAAAAATGN 
GCTTCACNGGACA 

SEQIDN0323 

GCTNNGGAATNGNANTGGAGCANNTGNACNNGGACACTACATGACTATCAAAAATGGGCGTC 
NTCACGACA 

SEQIDN0324 

TGAAACTATGTGCAAGAATTAGTCAGTTGACAATAATTTGATTGAGTCTTTCAATTCTTAGC 
ATTTTGGAAGCTAGATACAAGCCTATGA 

SEQIDN032 5 

ATCNTGATNNTCGGCCATCTGGTACNTGGAANNGGCGCTGGTGAGACTTGANTCTNGNCAGA 
GGNGGACCCCNAGCCACGAGCAGGATGCTGCATTANCATTGCNATCAGCAGTATAGGAATTC 
TCTTGCTCTGGCCAGATCGAATTTGAGGGCCATGGCATCAAGAGCCA 

SEQIDN0326 

GTTAATACCCGGATGTGGAACAGGAACTTCAGTCTGTNNGATAAGAATTACCTCTCCAGCAT 
CCAGGCTCAGCAGACTCTGCATCAGATTTCTTCACAATTCAATGGTGCT 
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SEQIDN0327 

TTTTTNTAGCTNCTAANAGCCCAAATTTCTCCGAGNCCAAAACAAGGTCAAGGTCCAAACAG 
TGAATTGGCCTTGGAGCAGGGCGTGAAAGACTCTGATATAGATGCTGCAAAAGTTGCTGCAT 

TGAAGGCTGCTGAACTAG 
SEQIDN0328 

TNNGCNGATNNTAAANTCCCCTCTTCGACGACNACNGCTNAGCATGCNTNTGTCTGANGAGT 
NCTAAAGGCTGTTNCCAAATTTACTAGNTCTTGACATGCGTATCTAACTGGANTGATTGGTA 
GANTATAAAANTGNGACAANNNGTNTGACTNG 

SEQIDN0329 

TTTAAAACCCTTNTAAAAACCGAAAAATGCTTNTAAAAGGGTCCAAGGCAGAGACCAAGAAA 
ANTAACTGTTGAAGANCGGANAGATGGAAGNAANGTANAATTTTGTNNAAGGATATGGTNAN 
GAT T GT T T T TN AAG AGAN GN C GN AAAACN AAC CC C AAAAT T C C T C C AG 

SEQIDNO330 

TTTTAGCGGCTNCTAAAGCNCGGACTAAGAGACCNTCNGCAAATGGCNAGGNTTGCNAGGTA 
ANNGCNTGNCNNNCGCNANTCNNAGTGCNCCCTTCNATNTTAGTACTNTTNCGNATTNTTAG 
ACTATNNANGGNGANAGTAGTACNGACCGGAANANGAGGCTCGAGACTTGTGACACCAGANC 
ANANTGNGCTACNCCCCCGCTAGGTATTGTACNCTTCCNNATGAACNTNNCGNTGC 

SEQIDN0331 

TNCNNNNNNNCTNCCGAGCNGNTNTCTCTGACTTAGGTNTATATTCTAGGAACTCTTCAGTG 
GGAAATGCCGTTNAAATTATGATACTAACTGTTAAGGTAGGAAAGATTACTGGTTGACACAG 

CATA 

SEQIDN0332 

GNNNNTGTNNNNGGNGTGNNCGCATNGGGTGAGTGGAGTTCACNAGNNTGGGNNACTGAAAT 
TTATAGAGACGCTANTGAGGGGGCGGAGNGGCCNNNNTCNNATTCNGACNTTCTNGTGCCNN 

ATNACNATTAGAAGNA 
SEQIDN0333 

CCCNNNGNNNNCGNGGAATTGCGANTTGNAAAGCAACNTGTTGTCATGNAGAGCAGGAAACA 
AAATNTCGTATCTCGATCTAGANCNTNAGCACANTACAGANNTATGNNACAGGCTGTGNGNG 
AGGTANTCANNTATCGGTTTGTA 

SEQIDN0334 

GNNNNNCNNNNGCNGNTCTGTGGTCTTGNCNTTGGANATTAAGCNCCTACTTNNTACGNTAC 
TGNANNAGNCNGCNTCTANGAGCAAGCNACNAGCCCTACTACTANATTNANCTACTGCCTTT 
ATGTNTAACAAAGNNNGAGCAAGANAGGACCAACAGATGCTACTAGCTAGAGTTGATCATA 

SEQIDN0335 

TAAANGNNNNGNAGCAAGGAAGCTCTAGCTTGAAGGATGCTGATTATNANTTTTGATTAGAA 
TTTTACAAATGTAAAGAATTATACTAATGTAAAGAACTACGTTTGGGCTTGATCCCCATAGG 
AGCTTAGCCCGGGGTACGTAGGCAACCTGTGAGAAAAGGAGAGATCAGGTGCAGCCCCTTGT 

A 

SEQIDN0336 



FIGURE 4 (continued) 



BNSDOCID: <WO. 



030851 15A2_I_> 



WO 03/085115 



98/140 



PCT/EP03/03703 



ACNCGAATNGNAAAGGAACCCGAAACTATGANTNNNAAACTNGNAATTCTTTGATGCTACAA 
ATTGGCACTGNATNG 

SEQIDN0337 

GNGTCCAATTNNGGTTTACGTGTTACTNTNGTTTTCCCTGCTCATACTAAGCTGTGAAGATG 
ATTTAGTGCTATTTGAGTAGCAGTGGTTGTAAGCCTTGGGA 

SEQIDN0338 

GTTGTGTNCCATTCNGCATGCTNTATTACATGNGTTGTATGAGGTGNNACTGATCAGGAACA 
CTANATGACTATCAAAAATGTGCTTTACGNCA 

SEQIDN0339 

CATTGTCTTCTTTTTNTTTCTTCTTTTGGCGAATTTTCTTTTGNTTTCTTGA 
SEQIDNO340 

CTGGGTGTAAGTNGAAGAAGGATAATGGACAAGTGATCCAAAGCATTATAGGGACGACACTT 
TAGGCA 

SEQIDN0341 

NGGCCGGAGTGGGTGNGGNGANGANTGGATCGTTGGTGAGTGGTGNGTNNNC 
SEQIDN0342 

GCCGCATACATGCATATCCGNGGGGCAGCAGGATGCGGGAACAGTTTTTTNATGGGNACCCC 
TANTGCANGNNCN 

SEQIDN0343 

AATCCNNGTNTAAGATTNTCAGCNTTGGGCNAGAGNAAGCNCTAATCNTGATNANCANTGGT 
GAACC1SIAANTANCCAGTTACCACCT 

SEQIDN0344 

GCTCTTCTGTAAANGGTTATTTTTTGACTGACANNCAAGGGGGTAAATTTTTANTTANNACC 
ANAANTTGNTTAAGGNNN 

SEQIDN0345 

ACGTACACATTCTCCTCAATTGCTCAGGAAATGGTATTGGGTGCAGTTGTACNGGNAACACT 
AC AT G AC TAT C AAAAAT GT GC T T C AC G AC A 

SEQIDN034 6 

GGTGCGATCGNCTGCCGAAGAAGCGTTGTACTTGNAAAATATCGGAGGAAATATCCCTGAAA 
TAACTGCCAACGCTGGTGCAGNCAAAAGGTACTATGTTCGNTCTTNNATNTAGCA 

SEQIDN0347 

NCNGTTATAGTCGANACACANGGNATGCCCTCTNGNAAACATNTATTGTACNGGATGACGTA 
TTCTGATANTNNCTTCAAANAAAGANNCATCACTAGNGAGCACGAAAGATAAGTGTNTTNTC 
TCAAAGAAATGACCA 

SEQIDN0348 

TNGCCGNTTNCCATGNNGNACNTGGATANTCNAANNCTNTCCGNNNGNGCTCGNGNNTANNG 
NCCGGCNANACACCANNCCNACTNTNTGTGACGCNTGNAGGACNANCTATGNTGGNAGGANT 
TNATAGNNNGNNCCANATCNGCNCTNGACAGNCACTNNCCTGNGACTNCNNTGNANC 
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SEQIDN0349 

TAGNCCGCTNGTTCAAGAGATTNNGCTCTGGCATCTGTAAGTGAGATATCAAAGCGCACTTC 
T G AAAC C C C T C AAC G AG AAAAT AG AAG G AAT N C AAC AAAG AT T G AC C AAC CAT T C T GT AG AA 
GCAGAACAGAAAAGGGTGAATTGCTATCACACTCAGGAAATTTTGAATCAATAAACGAGAAT 
GGAAACAGAACATGTTCCCGTACTGNATTTTNTCCTTTCAGC 

SEQIDNO350 

CCCAAANTCCNTCTTNTACGATTACTCAGGAACNNATNATGNGATTGNNCTNGACCGAANGC 
CTTNTNCGTGATTACCTGGAAAAGCTGCAGCTGGACA 

SEQIDN0351 

TACTATGTTATTGTTCGTCANGANANTNTGCAACNGNTGNCCCA 
SEQIDN0352 

GCGAGGGCCTCCCAAGNTGAGTNTGNAGCNNGGNGTNANGNAATNAAGAGNAGAAAGAGGNT 
CANGCGGNNGAAAATGTAA 

SEQIDN0353 

GCGCGGGACCCTACCGAANGGGTAATTTGNAGCAGNCTCGTACAAAANATAGGAGGAGTANA 
ANGTAAGNTCNNGCGGAAGANNATGTAA 

SEQIDN0354 

TGTGNTTCTTCCTGTTATGGGGACTTGTTGGTTATTTCCTTTTTTGTGAAGCTCTGGTCGTT 
AC C T C AAAGT GT AT GT AC T T C C AAACGGAA 

SEQIDN0355 

ATTGTCTTCTCTTTTGGTTTCTTTTCTTTTGGCGAATTTTCTTTTGTTTTCTGCTTGA 
SEQIDN0356 

AAGCACTCTGTTGTGAGGTAAGGCCTAGTCCCCCTCTCTATGTACTTNATTTNTGCCATACA 
TTT 

SEQIDN0357 

GGATTCAANCCATCGAGGGTCCATNGTGGTCTCCGGCTTACGGNCTATTNGTGNTCAACTAT 
TNGGTGGNCCGCATNNTTCTTGTANACTANCGGGAANATCT 

SEQIDN0358 

TGCTCAGNTNGATCNAAGGGGNGTNTTTTNACATGGAACAGGGCAACTGCCTCTACTTGNTT 
TNATGCCTTTTTCATTNNGTNCATTTCTAGGGATCGGCCGT 

SEQIDN0359 

GATNNNNTANCCNNGGNCTATNAACGTTNCCGANGCAGGTNCGCNATGCTNTGNCCTTATNN 
CATNGCGAANGAGTACCNGGANANCCCNCNTGGACANACNTGAGGGCAGCCATGGGNAGGCT 
GANACAAAATTCTGGTTCACTAATTTCCATCTTTNCTTTTTNTTTATNNGCCAACACANTAA 

CTNTATTGGTACTAGAACATGGNATTACCTTTGGGT 
SEQIDNO360 

NACGCAGNNNAAANACGATGACGAAAGNCCGCCAAAACCACTGACTTGACACNTNNAAGATT 
GCTNGGGANCANAGGANGCN 
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SEQIDN0361 

TGTNNAAANAAGGCGTGCCGAGGCNGACGGATGTGNCANGTGTCNCANGACGATGTTACTGA 
ATNGGTANTTACANCGGGAATCTGTGGCGNTCATGC 

SEQIDN0362 

AAGNCGGAANGTTTGTANCCCGNACCNCAKlANAAATTCACATTG 
SEQIDN0363 

GGAGCACAGCAATTCNAAATTCTTTCTACCATTTTGGTTTCATATCTAAGTCATTCCCTATT 
GGGCTTGCGCT 

SEQIDN0364 

GTGGGATGCTGACNNTGNAGCTNTTNGTNTNGTNCCNNAGNNATTNCNNGCNATTAAGCAT 
SEQIDN0365 

CGCTANNNTAGCANTCCGATGTGAGGGANGNNNCNAGNCCCCCTCTTTATGAACTTGANTGC 
TGGCATACA 

SEQIDN0366 

GNAAAGCTAANGTGANNATTAGCACTCTGNTGGGAGGGTANNNNCTANANTCCCCTAANTAT 
GNTACTTAATTGGGGCCGTNCAT 

SEQIDN0367 

GCCGGCTNNTGNAGNGNCGNTGCTTNNTTNAGTNTNNTGAGCATGGNCCTNNAG7\AAACGCT 
NGTGGCATGATGCNTNANGGGGN 

SEQIDN0368 

NNNTATCCCTGCTGTGAGGAGTGTTNTTCCTTGTGTNATGCCTNTATTTGNGTTTCCGCNNT 
TGTGCTCTTNTCNTAATGTATAGATTNTNACTGTAGATTCTCAT 

SEQIDN0369 

GNCNGGTNGNNGAACTAAAGTAAGTNGGTAGGCATGGTGGCGAATGAACCTAAAAAGTAAAA 
T C T AAC T T GC AG G AT C AAAC AT AN GN T C A 

SEQIDNO370 

CNCATTGTANATCAACCTATATGATGGACTTACGNGAAGTTTCCAAGACACATGACTAAAGC 
TGACCAAGTCTANTAGGCTAGNTCAAGCCCGTACCGTGACA 

SEQIDN0371 

TCGTATTTATGCNCATGAATGATGTGCAGTGNTGTGTCCTGACTNATNGGAGCCGTTGTCAA 
ACATGNNGTATGAGTAGGAAGNATTNNCTGCTCNTCTCGGNCATGNAGGNAGCCANATNNGT 
CNGNNAGTGCAGAT 

SEQIDN0372 

NCGATNCNNANGAClNlCANNNNNGCGAGGTGNGTAANANTTTGlSINACCTTTANTNGCTGCACT 
ANGANATCGACNNGCNCNGTGANNGNNNNACNTGAGGAAANCANAGCNGGAATGNCTNAGTA 

SEQIDN0373 
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AATATGGAACTGGAATTATGTATCTGTATTACTCCTGTTGTAATGCATTGACTTATACGGCC 
TTGG 

SEQIDN037 4 

GCTTATAGTGCTGNATTTATGCTGATAAATTCTGTAACATAATAGTGAGGTTGTAATGTAGA 
TGTTGAAGAGCTACCTG 



SEQIDN037 5 

GNTNAAGCAGNGTNGNTAANAGGNNGCATTTTCTAGTTTCAGATTTTTCTGTTCTTGGAGCA 
ATAACATCCATCTTTCTCCT 

SEQIDN037 6 

CCGTTNCCCTCAAACACCCTTGAATCCTATCGAATCTGGATTTGAAGACGAACCCTAGAAAT 
TCCAAAATCCTAAATCGAGTGTTCGTTGAATTTTTCCAGTCTAAATTGATTTTATTCGTGTG 

TTCTTG 

SEQIDN0377 . 

TTTGNGATANNTTTAGTTGGATGGNATGGAATGCTTATCTNNTATNCGAAANGATGGT 
SEQIDN0378 

TTGAATAACNCCAGNATNGGCNNAATACANNCCCTAATANCGAATGATCTGGTATTTTACAG 
GNCTGACGGGGGGNCGCCCTTTTCCGTGN 

SEQIDN037 9 

TGCTTGTANNANGCCNATGCTGTNTGGTGGNNCGCGCACGTNGTGNTCNNNTGAGAGGACAT 
NTCTGANTTGNGCCAGGNNCCNGANGAAGACTNCCGATANTTANTGCCGAGGCNCATGGGGG 

SEQIDNO380 

TNTAGACCCGTTTTATACAAAGCCCAAGGACTGAGACTNTGTACAGTTGCGGAATCTGCTTG 
ACCCCTTTTACATGGTTGATACTTGTAACCAAACAGAACATGCTGAAGGTGCAAAAGGTGGA 

G 

SEQIDN0381 

NGCACGGCCCTCGGNCTTGCAAAAANGTGGNNACACCCTCGGGGNCNNNGCCAGNGGG 
SEQIDN0382 

ACTCNANNCCCGCGTGCTCGCGCCAGCTCCCAATGCAAATGGNATAGAAAAATNCAATGCTG 
AGCATCG 

SEQIDN0383 

NNCTNNAATGTAGCTAGTACAAGTGGNAGTGNGCTACACAATATAGCTTGACCCCGACAAAA 
ATNCTNCACGCACTAGNAACTCATGACATGGTATACG 

SEQIDN0384 

CGNAGCNCGNNCGNACACNNCGACAAAGGGANCGNCACANCC 
SEQIDN0385 
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GCCCCCTGTNGCTGCTCCCTNAGTGNTNGGNCATNCAGTGGTAAGCATATTGGCCTGCGCCA 
GCATACTCTAANCATGGTNTGNGATAGAATTCCATCACGCTACTCTNGNGGCNCATGAAGAG 
CATATCCG 

SEQIDN0386 

NTGTAGCTTTCTNTGTAAGCTTATGTACCTANNNGNNCCTGCACCGCCCATGGCTGCCGGAT 
CTGATAGCTCCCAAACNATTNGTTTCAACCACAACCCAATTCTTGCCCCAAAACCAACCACA 
TCGTAGCCCACCAGNTNTGTTCTTCTCTCCG 

SEQIDN0387 

NATNATCTCCGTGAGAAAAGACNCTAATGANTATNGNTTAANCTTATGCCCTATACTCATTC 
GACGACTNACACTGNAATAAAGCCGAGTAATNGCAAATGCATTTATTTATACTACACC 

SEQIDN0388 

TTGANTACNNTNNANTNCNGNCCTTCCNTNCAAACAACAGNACNNTGAGAAGCCATAAAAAT 
ACAGCTAG 

SEQIDN0389 

GTGTNTCCTTGTGTNATGCCTNTNTTNTTGTTTCCGCTATTGTACTCTCATCATAATGNNTA 
CCATTTTTCTGNAGATTCTNA 

SEQIDNO390 

CCTCACGTGGTCTGGGACAGGGNACCNCGCTGGGCTGGGGCATNTNANGGCTCATATCGTGG 
CAGAGGACATGGCACTACACGAGGTGGTCGCGGTCGTGGAAGTAGCAGTTTGGGGCCGTGTC 
AG 

SEQIDN0391 

GGCCTGGTCNGTGTACTTANACAAAGTCCCAAGGACTGAGACTNTGTACAGTTGCGGAATCT 
GCTTGACCCCTTTTACATGGTTGATACTTGTAACCAAACAGAACATGCTGAAGGTGCAAAAG 
GTGGAG 

SEQIDN0392 

CTTAGCANCACAGCTGCTTANCACAAAGATACCAGCCCAGGGAAGTTGAATTTGNNTGTCTA 
CAGCNAAAGCCATTGCNGANGNAAAGCCCCTNGTTN 

SEQIDN0393 

TACAAAACGTNTTCATTCTTTCNANTAAATCTTNTATTNTTATNAGAGACATGGGTNGCCCG 
TTNGANGGAGTACTGNTGTTCTTCCTCNNGNTNAGTTGCNGAATATTGCANTNGCTGC 

SEQIDN0394 

GCTCTACAGAGGACAAGNACTNATATCTGNAGACAAGAGGGAATTGCAGCACTCANGATGTG 
GTAGAACGGACAAGGGAGTTTCCTCTNNTGNTCAAGTGATNTCTCTCTTC 

SEQIDN0395 

CCTTNGNTAGGCCGNCGACCTTCAGGANAACCTCNNTNCNGGAGACCGTNNCTNTCGNCNTG 
NTGATGGCCATNNNTTNAAACGNNTTGTGATG 

SEQIDN0396 

TGTAGTGAGGAGANTGAGGCTGCAGATGAGGTGGCTGGTAAAATCTGTGATGAATTTGATTC 
AACGGTAGTGAATAGTCATGTCAAAAGACTACCACTTGCTGATGTAACTGATTCATATCTGA 
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ATCTTCCTGCTTCAATCTCTGCAGCTGAGAGGTCTCATGCTAGGGGAAGTCTGGATTCTGTC 
AAGACAGATGCTAGCTGCACTGGGCATCATAATAAAGCCAAAAGAAAGCTTGGAAGTAGC 

SEQIDN0397 

GCAATCTNAACTCCCGACTTNGNTGNGTNCTGATNTCTGCTGTTGAATCGGCTGTTTGGTGG 
CTGAACTCAGACCATCATTTTGGTCCATTTCTTTGATGTTGTTCTGCTTGTAGTTGTCCTGA 
AGTATTTATGGAAGTTGATTCAAGTCTAATAGTGGCCTTTACTCTGCATTTTAGCTGTCCTG 
AAGAATTTATGGAAGTTGATTC7\AGTCAAATAGTGGCCTTTACTCTGCATTTTAGGTACGTA 
CAGGTCAACTGT7\ATTCTCTGTTGCATTTCTAAATGAAAATATGGGTTATCTTGTCATGTTT 
NGNG 

SEQIDN0398 

TTTTANGCAAGNNTNNCCTCCCANGAACAANCCCTTAGTCCAGNTTCAAAG 
SEQIDN0399 

TTTTANGAGNAACTAAATCCCCTTNTNCCGANCCCNTGCAAAANGNGGNCTANACNGNNNNN 
NTGAGNGNNNAATNCNAANATNAAACNCTGCNTTCATTCTTTNCCTACTGATATGAGACTGT 
CAATNCTGNCAGGGCAC 

SEQIDNO400 

TTNTTNGGCTCGTCAGGGNGATTCTTCCTGCNTATGCTGATNATGAGTTGACCGATGTTCAN 
TGTTNNNTAGANCTGNCCNAGTCCNGGCAATGTNNCAAGTATATAGTGGCACTGCNCGGTNT 
TATGNCAACATCAATNCTGCGAAAAGCTTCACC 

SEQIDNO401 

NNNCACTNCTAAAGCNCTCTCCTAANGACCCCCAAGAGGANGCNTNTACTAGACATNC1SIACT 
CAGGCGNGATCCGCANNCCTGANCCGCGTATAGCTGGTATGATNGGNCANCC7VAGGATTNTG 
GNNTACGAGGGCCGTTANGTGNGANANGCACAATGNNGGACAANANNTGNACCTNANGNGNN 
ACAACNCAACCCAAAGGCTAACTATGCGAACCAGACACACCTACTAACGCTCTACTATGTGN 
CACAAGCTGTGCGGTACGACAAGGC 

SEQIDNO402 

NGGCGTGGTGGCTGNAANGGGTCTNANGNTGCC 
SEQIDNO403 

TTCTCAGGNAGGCGGGGGTGNCATCNCTGAACACCANAGGCAGNTNNCC 
SEQIDNO404 

AACCTCTTTTCTAGNAACCACTCTCTNAATNTGTGGTNGGCGNTTNCA 
SEQIDNO405 

ACCNCNANNCNTCNGAGGGANANGCCNACNTNNTGGCNGTGGGCCCGGAANTGTNCNAATAT 
AA 

SEQIDNO4 0 6 

TGCCGGGGTTNTCNNACAAG7\ATGCCNNNCNCTGNNNCGTGT1SITGTCTGNNCNCATATGCNG 
GANANGNNCNTGNCCNAAANNNGNCATNGTGCCTTNCAGTAGNATNANCNGATCANCTNTNA 
GAGTNNCCNNNNCAGGNNNNCNNCAGNTNGNTAGTGTNTNTGCTNTNGATNTGACCTTACTA 
TAAANATGAANCGGCACNACCATAAGGTATAAATGTAGGCACANTGCTTGCTCTATA 
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SEQIDNO407 

CTCGGTGAATGCACCATCCTCANTTCAAAGTGGTTGCTATGGTNTANCAGACANCATATCGG 
TNACANNNTNCGAATTGAACGAAGAATTTGGNGGTAAACTNTGTCAGCAGAGCATGAATGCT 
GGTTTGTCTAGTGGAGTTGAGGTTATTGATGTTTNTACTCCTCCATGCTACAAGGTAAGTGG 
AGACAGCAAGAAAAGAAGACTTTCTACGGCTTGTCTTGAAATTATTGATTTGACAGACTCAC 
CTATTTTTGTCTGATGTAACTAATA 

SEQIDNO408 

TGANAGAATGGGTTCTANTNAGGAACNATGTNTTGTA 
SEQIDNO409 

ACTCNCAGTTGNGNGGTGCGNAGTAAACAACTAACAAGANTGCGNAAGCATTCANGAGGACC 
CACTGTANGCTTATNNCATCTNGATCAAAATCAGAATGAAGTTATTTCTACTCTTG 

SEQIDNO410 

TGGNAGCGCCGCGTAGCGANAGGNACTATAGCCTGGGGTNGTATAGACACNTATNGGCTGGC 
ACANCTTCTNACA 

SEQIDN0411 

TNNCATTGAATNGCCCTACATNTACCAATNTGNAATCNACTGATACTTCTCA7\AACATATCA 
NTGNCTTGCCCACTTCATTACGGGNTTGTATGANAANCCA 

SEQIDN0412 

TTCCCGGCCTGGTTNCCCTACTNATACTCNACCATACCCNAGAAACCCNTAACCTAATTCTT 
CATTNNCTCTCCNCATATCATCNTCAAATACTCTNTNCACANATTCGTTCCTTCTACAACTC 
CATCACTTTNTCCCTCTCGCCACCGTTCCAAGTATTGCACATGGGTGANAGCTGNTTNATGN 
TCTGTNGCTGNGACAGATGAACAACACCATATCGCNAGTAATGGACTAGTACACAAAGAATA 
TGCTGNCC 

SEQIDN0413 

CTNANTGGCGNNATCAGTGCTCACA 
SEQIDN0414 

GTTCTTTTNGCAACTTTGATCGGGAAAGGGCTCNCA 
SEQIDN0415 

NTACTTCTGTTTTCTTTTTGTGTCAAATATTGTTTGAACTCTGGGTTTTCTACCACGTGCCA 
CGGTACCACTGA 

SEQIDN0416 

TGCAATAATGAACAAAGCAAGATATCAGTAGTGATATCTTTGTTTTAGAGCATCTTTGTTTA 
GCTGCTNTCCACTANCTACAAAATTGAATATTGCAACATTTGTAACCTTATTTTTATCTTGG 
CAA 

SEQIDN0417. 

TCCCTTGTTTNATGGAGCCGATTACTTTATGAGAATGCTCAG7^AACTTCAAGCAATCGAGAC 
AGATAATCGCAGGCAACGAGCAGCTCTGGTGACCTTACAGGNNAAGGTAGATGCTGTTGCTT 
ACCCAAGAGGAACTCTGGGTGAAAAATACGTGCATACTTCCA 

SEQIDN0418 
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ACCCCNNTNTA/^AGGGGCCAAAGGNANAANCTGCAATCATTATTCGATTGAAACAATCCTGC 
GATNNANACNNGANANNCTGANANATGNCTNAANNNAAANATTTGTGCTGANNGGGGTGCTN 
. TTCNNCATGAGGANTANATNNTNNCANCNNCTNAAGCTTCTTTCCATACTGGA 

SEQIDN0419 

TTGCGTGGCAGTTNGGGGCANAGGCACTGGAGACAAGGGCNACTCCAA 
SEQIDNO420 

AAGCAACCTTGAATCAGACTCCTCACTGATCTCTCCTTCTCGTCACTGTTTCTGTGTGTGTG 
TGTGTGTGTGT 

SEQIDN0421 

GAATATGGAAGATTCCGAAAAAGTGTCAATAGATGGCAAAAATCACAATGGGCATGCAAAAT 
ATAGTTTCAAGAACACAAATCGGAGGAAGATGTTTGGTCACCCTGT^AAAATTTAGTTCAGTG 
GAAACTGCGATGTCTAGAATAAAGAATAAGAGTCATAGACCAGCTGATAGTGATGGAGAGGG 
TGGAATGT 

SEQIDN0422 

TNAAANANAATGNATTCCNCTNGGGT 
SEQIDN0423 

GAATTCTTTGGTGTNCATGCGAATTACGCGTTCAGTTCTTATTGGGCTCACGT 
SEQIDN0424 

AGGATACAAAANCGAANCCNNTGNGTGNCTACACTGCNGAACTGCGTCGTTGCAGGGTCTTA 
TTGGGCTCAGT 

SEQIDN0425 

NAACAATTTGAAATAATATATTTCGTCAATGCAGCTTGCAAGCTGCAGAGAGGAGAGTCATT 
ATAGTAACTTTATAACTTTTGTTTCAGTTTACAAACCTTGTAAATTTTGACCATATTGAAGT 
TCTCCCTTCAG 

SEQIDN0426 

GCACATTNTCACATCTTTACTAANATAAGAAGATTNCTGTANCATCTACTAAGATATTGCAN 
AATNNTATCAGCNAGAGTGTTGACGCCGC 

SEQIDN0427 

GAGACTTTCAATTGCGTGCNTGCTNTNANCAAGCCGCGAGACANTNCTAATACTNNGACNNG 
CTGGNAATGNGNCATCTNGNNNNNCTANTNAGANNCNNANGCNCACAANGTNNACTGTGTCC 
TTCTGGCTGATGNCTTCCCNAGCATTACGTGNTGTCTGCGGCCTGAATAAGATACTGCCTCT 
GCAAATCC 

SEQIDN0428 

CCTGCTNATTGGANGGAGCACTGAGGGTGGTACTNNTTGCAGGAAAATGCCTGTCNTNNGNA- 

CNCAANTNCANGCCCGNNNNGCACGANGTNGATGCGGNACNANNGCNGCNTNATATCTGNNN 

NGNATCGNNANGTGTNACACGCNANNGANAGCACCGGNTANNTNTTNNATCCTNTGCCGGTG 

TACCTTTGANNTNANANTCCTCNTGTTACCNGANGNCANGTGCTTCTNCTNAGCTTGNTANT 

TGAANTGGNGTGAGAATGAATGACCAGCNGCT 

SEQIDN0429 
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NAGNTNTAATTACTCGGNC 
SEQIDNO430 

TTTTANNCAGTATAANTNCCTNCCCCTTAAACCCCCCACTGGAC 
SEQIDNQ431 

TGGNACTTTCTNCTCTTCAAAAGCTTTGACTCTCT 
SEQIDN0432 

TGTGCGCNNNGTGNATGTATATGTGGTCTNGGCTCTNAGNCTGNCT 
SEQIDN0433 

TGCAGCTTGGGGAAGACCAGGATATGAGCGNCGGAGTGAGCCACTCCATAT 
SEQIDN0434 

GACAACACCATCAGGTACAATGGCCAAGTCGCAGGCACTGGGAGACGTACCAAGTTAGGAT 
SEQIDN0435 

TGAACTAGANATGTCATTCTATAGCNAGTATTCAGCCNGTGCTGTGTNTTANCATAATATNA 
AGAATNTTTCTNACTTACGTGCAGGGGAT 

SEQIDN0436 

NATAGAGGAGGACCCATCTGACTCCCGTCTTCTTCTTCATTAGAAATGGGAATCAACATCCA 
C G AAC AAAAAT GCTATCGC TAT 

SEQIDN04 37 

TCAGCCTCCCGGCTTTAACCTACTGNGGGNACAGNATGTNGGAAATNCCNGCNAAGCTGGNT 
GGNT 

SEQIDN0438 

GGNNATGCNCATTGGAAACACNCGAATGAAACGTTTCTNTGCGAAAGTACTCACCAACGAGT 
GCCATTGGAAAGATTTCTATATTGTTATGGAACGCCTAGANNNCAATACAGTGNNACGCAGC 
ATCT 

SEQIDN0439 

CATCTCGCAATGTNATCCAGNGTNAGCTAACNG 
SEQIDNO440 

TATAATCCNGCACTCNCAGGANCGCAAATAGNTGTGNNTGATGGTTATT1SITNGTTATG 
SEQIDN04 41 

ACCNAAGATCCCCCNNTNAAACACCCAATCCCCCCT1SITCCGGCAATGAAGCTGCCGGAGCTG 
. ACATTGATCTGGCCGATGTTTTCGCCAAGTACTTGAACCAAGGTACAACAAATGATAATGAT 
CATGATCAAGATAATATTCTTCAAGAATCTCCCTTGGCTGATCAAGATTATTGTTCTATTGG 
AGCAAGCTTATCAAATTCTCCTTCATTAGATAGCTTGG 

SEQIDN0442 

ACCANAACNCAANNGAAAGGGCCCCTACTTATAGNNNCCAAGGAGGAGNACAAGTTACTGAT 
TGG 
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SEQIDNO'4 4 3 

CTATTAAATACCTCCGGGTTTTAAANACCACNCGNCTATATTACCGGTTCCGAANCATTGTG 
CNG 

SEQIDN0444 

NCNCGGAAAGGCCCCCCTTNGTGGGGNAAACGACCCGGACTCTCNGGCNGCCC 
SEQIDN0445 

GTAAGGGTAAGGTCTTCGCTACAACTACAGTCGTTTGGTGGGTAACAACCATC7y\TAACATT 
ATCATCCTTCTCAATCTTAGCCG 

SEQIDN0446 

TCTNTGGNAAAGCCC1SITGAGANATTGGGAAAACTNAACAAACNGNTAAGCAGCAGGAGANCC 
NACANGNNNAGNGAGGCCATTTTTTTNCGACANCNGNGATAACAAAAGGAAGCAGGNGGCAA 
ATTCGAGCTCAGACACNGAAAACCAGNNTCTNA 

SEQIDN04 4 7 

ACTGGCNTNTGCNAGCGTTAGGTTGCTGGTTGTCCTTTNCTTTTNCACTATNNTTTTTGNGC 
TGTNNNTCTTCACCGTTTAGGGANCATTACCCAGTTNCAAAANCAGCTCNGTNACATCCGNC 
G 

SEQIDN0448 

CTCGGCATAATCGTGCGTTATATCGCTGGTAGTCCGAAACATTCACAAGATTATTTTTCTGC 
TGATGCTCGGCATATTATATGGATTCTTTATTCATCGATATTGGCACTTGATATTTTCTGAG 
TCG 

SEQIDN0449 
NCCGCAGAGTCCCTGCAGC 

SEQIDNO450 

GGTNTTGGANCTCCATTCTCTATTAGCCNG 
SEQIDN0451 

AAGGTGANGTCNCAAAGANNTGACCGGGGCCTGNNTNTGNTCNGNNNACAGGCATANCNGNA 
G A C N G AAG C G AN G AN G AC T N AAG 

SEQIDN0452 

CANGTGCAAGANTGTTCNTCGAATATTTTTGTATTATATANGCAAATAGTAACCCCACACCT 
ACTAGTTGTTTCTAATTTTCATTTTCTTTTCATTTGTTACTGTTTCGATTTTTTTCTTACCA 
TGTTGGATAAATAATGTGTTGACTATAA 

SEQIDN0453 

TCNNTGAGCTNNNTTGCAGCTTCTAGCNGANCTTTNTTTGCAGCGTCTNGCAGNNGNTTTNT 
NNGCCATGNTTGTTNTTCCTNTNCATAGCCCNGTGTATTTTTGGCTATGANCCTGCTCTAGT 
NTNCATCTGCCTTCAGCGTGAGCCTNGTCAACTACATTNTTCTTGGAA 

SEQIDN0454 

NTGNGGAGCATGAGTTTATTGCGTTTGATGGTTCACATGCTAAGTCTGAATACATTTACACC 
GTTTTAGATAACCTAGTCGGTCAAANACAACACATTACTATTTTTCCAGATGCTGATTCTTT 
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AGTTCTTGAGAATAGCTGAAAGTAATCAGAGTTTAGATATGCTGAACTTCCAATACAGCCTT 
AG 

SEQIDN0455 

GATGGCAAAGCAACATTGNACAGGNTGAGGACTACTAGAATATTANANGCTNNTATTGGGTA 
GGNCATACGTTGGTNCTGTGAAAGGGAATCAATGCCNTGNTNTNNCTNGCNNGANNTNGAGC 
NTNNNGGNGCACAAATGNNCTATAANNAGCCCTNTNATGNAGGNGGAGNNCACAAGNGNAGG 
ANGTGATGCCNANCTGACCTAGCTTGTGTAACACAGGNTCATTGANAG 



SEQIDN0456 

CAAAGAGTGAGGAAAAATGGAAACTGATTGCGTGGTGCTACCGTTTCACACGGTATACATGA 

AAAGAAAT CAAGT CAGGT AT TT TG AC AGTG AGGATT CAT AAC AAACAAC AAGAT CAT AT ACT 
CTGAAGTAGCCGAATCCAGGGAGCTGTTGATCTGATCTCGATCCCCAGCAGCGTGCAGGTGA 
CTAACAAAGCTAAAAACCAACTCTATTCAAGAGCTGGAGGTGCTTCAACATAATAAGTAAGG 
GCTGTTCATTCTTGATTCTTTCAATTAG 

SEQIDN04 57 

CTTTGCCACATTCTCGGCGNCACTNGTAAGTAG 
SEQIDN0458 

TGNGCANANAACANAGGACTNAGGCAAGCGNTANTATGGGGANNGGANCCNANGNGGCNCNT 
CAAGTGNANTC 

SEQIDN0459 

CCCCGATGCCTTCAGTAGACAGAAGCTCACTGCTGTTGCACCAATNTNCACCCCGATGACTC 
T GCCAG AGGGCGAACTAGTT GC 

SEQIDNO4 60 

NCCGGAGAAAAGGTCGAAACCGACCGTAGNTAGGACTNAGTTTCTCTTNCNGAAAGANCNTG 
ATCGGGCTCTAGNNCANAACCNNGGNTTTNAATATATAATAGANAAACTTCTTNNGNANGTT 

ATG 

SEQIDN0461 

TGAGGANAAAGAAGGNTACNGCNCTTNCCGATGNACACNCAGNAGGATGANCNATNNNACNG 
ACTCTCNATGCTGNNCGATGNCCAGAAGGTGAGCAACTGGAAGANTTTCTTCTGTTTTTNGT 
NCTTACATATNTGAANANNAATCANNNAAGTANGANCACTAAAACNAACCCATANTGGTCCA 

TAANCTNTNNNCCTN 
SEQIDN0462 

GGGTAATTCAACAGTGTAGATTTTTTTCTAGCTTTTGTAGCAAATGAATTTTTTTGATCTGT 
TGTTGTACTGTATCCAAAAACAAAAATGTTGTTCAATGAAAGATGAAC 

SEQIDN0463 

TCAANGAAGCTCTCACCAGTCTCCATTAGTAGAGTCTATAATTATGC 
SEQIDN04 64 

TTTGANNNCCCAANGAGNANCNCGNTGAAAAAGGNCCTGATGAATTCACCACCAATGCCTCA 
CAATCTTTGTGGNGGACTAAANTGTTTTTGCCTTTTNTTGAAAAAGCCTTTGCTCAGCG 
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SEQIDN0465 

NGTTGNATVACATGCNGCCNTCNGGGTCTATCCAGGAATGCGATTCTGCCAGATGCGATTCCA 
CACGCTAGTCGGAAAAGTCGATAGNTATAAAAAGAAGGGCAACTATCAGGGTGAGCTCGCGG 
AAGGACCTGTTCCTTCTCGTTCCTGGAAAATGTTTGAAGACGAGAGCGTGC 

SEQIDN04 66 

GGCTGCCNTCAGTCCACCCGGAGACCCAAGGTAGACCTGCAGGCGTTCGCGGGGTCTGGCGT 
CTCCCTCTATCTCTATTACCTGTTTCATTTTCTTTCGTTCAAAAACAGTTTATTGTATTTTC 
TTCAGGCCTTGTTTGTAGTGACTCTTAGATAGTATGTGACACTATGACACCAGATTTTGGGT 
ATTGAGGTTTTGAAAGCTGTAATAGATATAGTCTTGAGTTATAAAATTTGTTGATTTCCGC 

SEQIDN0467 

ACATCTAAAGACGGCAAAGTTCAAGAGACTTCAGCTCTGGCATCTGTAAGTGGGATATCAAA 
GCGCATTTCTGAAACCCCTCAACGAGAAAATAGAAGGACTAGAACAAAGCTTGACCAACCAT 
T C T G T AG AAG C G AC A C AG AAAAG GG T G AAT T G C TAT C AC AC T C AG G AAAT T T C G AAT C AAT A 
AACGAGAATGGAAACAGAACATGTTCCCGTACTGTATTTTCTCCTTTCAGC 

SEQIDN04 68 

TANGCANTTTTTNATNGTCGCNTGTANAAGCCNCAANTCNGATCGGNNCCAACCTTCTGAG 
SEQIDN0469 

TGTANCTTCTTNNGCTCNTCNGNTGGNTGGGCAGTCTGNANTNATCAGCTGNCTTCC 
SEQIDNO470 

GAACAGNAGAANNGGAAGNATANGGAAGNCGAAGGAGTGAGCACAACGGCACCACCATGNCT 
CGN 

SEQIDN0471 

TTTTTGCTAGGGATGGTTGGACNNGTGANTTTTGNATGTGAGTGCNTCTATCNTTTAGCANT 
TCNATNAACTTNCCCNCGGAAGGNNTTATNCGNGCNGAGCNTGGNNCNATATTTTGT 

SEQIDN0472 

TGGGGGCAAGCACCNGCGGCGGAGNGGAGGAGNANGTGNNGGCTTNNCAGNNNANC 
SEQIDN0473 

TNCTTTCAAGAAATCNATGGTGATGAAAATCTTTTTGNNGNTNCGANATGAGGATTCATTTG 
GAGNTAGACAATTACCAATTTTNCTTTGCCTTCTGTAATAA 

SEQIDN0474 

TTATTACTGAGCTTCATTTCTCCTGCTTTCAATCATATGCATAGCATGTAACACTTAGTTTG 
TTTCTAGAAAGATTCTGATTAGTATATCTATCAACGAATAGGATGTAACTAAAATCTGGAAT 
ATGTTAGTTTA 

SEQIDN0475 

CATTATGCGGANTTACAGGATNANTACAACGACTNATCTGANAAGCATANNTTGATCTTGCA 
GGNNNTACANGATGTNAANNTGGNTGCAGCAAAAGCAGGAAGAAAAGGTCATGGTGCTNNTT 
TNGCCAANANTCTCNCTGCGGAGCTCTCANCTTTGAGAGTGGAAAGGGAGAGGGAGAGGGAA 
ATGCTGAAAAAGGAGAATAGAAGCCTTANAGCTCAAC 
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SEQIDN047 6 
TATCNCAANTACTGGAC 

SEQIDN04 77 

NTTAGGTTAAGTACTTTATTTTGAAC 
SEQIDN0478 

CCAAGGAAAAAGGAAATCTTGATAAAGGACTTTTGGAATGTTGTTTGC 



SEQIDNJ0479 

AGNANCCCTGTTGTTTCATCGGATTCGGCTACTGCCTCATCAGAGTTGCTCTCAGATTTCGA 
CCGGAGTTTTCGTTTTCTGATGGATTTCA 

SEQIDNO480 

TCGGNATATAATATCACCGCAAATGACCTCGACTCTCAAATGGCGACCTTGACCGCGAAACT 
ACAATGATTCAAACTCGAAAAATGCTCAATGATGTTCAACCTGCTTTA 

SEQIDN0481 

GCCCCTTNAACAGCAGCAAAAAGGACAGCAGTCAATTCCCCTTTTCCCTAAGACTGCCAATG 
CCTAGTCAATCCATCATCTATCTAATCGGAAGCAGAAAATACCAAGGCTTCCAGAACACCAG 
AGCATTGTCACTGCAACTTGGTGGGCATTTTCCATTTAGAACTGACATCTGTTGAGTGAAAA 
TTTTATAGCGCACTCTTTGCACATCTTACTGGTCCAATAATGTTCTTCCAATTTGATGCTGT 
TTCTATGCTAATCCAAGACCTGTTTCCCGTCTCCT 

SEQTDN04 82 

AGACGCTGTNAAGTAATGAATTTCTTGAGGACGCTCATCGAAAGGACC 
SEQIDN0483 

TGCCCTTTTNCCAGCCGTGTGTTGNTATTTTCGTCACAAAGNTTATCACAGGTCTCAAAGAT 
CACCAATNAAGAGC 

SEQIDN048 4 

TTCCCGCTNTANACGCCCTTATTCGAGTTTGAGGATCTGTCNAGGTCGAGTTTACGGCGAGT 
CAAGTTGTAATCTTGTTGTTTTGACAACGAGTCGATGTTTTTAGTCAAGTAACNC7^ATACCA 
AAGGAAATGGNC 

SEQIDN0485 

CTANCGGNAAATCTCCTTCTTCACAAACGAAACCCTAGCAAAACTCCATCTNCATATCAGGN 
CGTTTCAACACTAGAGACCAAAGGAATGTCTCTTCAGCCAAGAGTCATGCCCTCCCATCCGT 
TCTGCTTCTTCACCATCTTCA 

SEQIDN048 6 

CTCCTNCTTTTATTTTACCGNTAGCTGATATTGTTGCTTTGATTGGCTTTCTAAAAATTGTA 
AAATGCATATTTACGCTTGAATTTTCAGAGATGTATTTTGGGTGATTGCTTTGTTTATTTTG 
AG AAGT AGAG AT AT T GAAT T CC AC C 

SEQIDN0487 
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AG GAAAATN GT G AGAGC AAAAT AAAT G AG AG AAC G AGGAAGAAAC AG AT ATG GAT AT G AGAA 
AACGATNCGCTTTTCTTCTTTCCCATTCACCTGAAACCAAAAACACCTCTCTCATTTTAGCT 
ACTGAAACAATCACCAAATGTCACCTAAACAACCAGAAAACCTCCA 

SEQIDN0488 

ANNCCCNNTTTGAGGGANNNNGGCTGGGNCTGATGNGTGTGATGCTACGNACTTANGANNCN 
ATGCNGAAAAAANGTATATCTACGTNGGANGGCCNTTGNTNCCTGGNGGCGNAGATGNCGCN 
ATTTGTACTTAGACACATTTCAAAGCATGTTGGCNAANGGAGATTGNGAAANTNTTGNTGTN 
AAANTTAGTCNTNAGNGTTACC 



SEQIDN0489 

TNCCCGGTTNGTTAAGNGACTTCAGCTCTGGCATCTGTAAGTGGGATATCAAAGCGCATTTC 
TGAAACCCCTCAACGAGAAAATAGAAGGACTAGAACAAAGCTTGACCAACCATTCTGTAGAA 
G C G AC AGAG AAAAG G GT G AATT GCT AT C AC AC T CAGG AAAT T T C GAAT C AAT AAACGAG AAT 
GGAAACAGAACATGTTCCCGTACTGTATTTTCTCCTTTCAGC 

SEQIDNO490 

TCAANTGANAGGTGTGGGAAGAAATGAAGAATTGTTGATGGCTTATTTTGGGAAAAGCCTTA 
CAGGAGTAGCTTCCGAATGGTTTATGGATCAAGACACGTCTTGTCAAACAGTTCCAATACAA 
CATTGACATTGCCCCAGACCGCAATTCCCTTTCAAACTTGAAGAAGAAACCAACTGAAAGTT 
TCAGGGAATATGCCA 

SEQIDN04 91 

GGAANCGGGAATTCTTGATAAAGGGACTTTTGGGAATGGTTGGTTTGGC 
SEQIDN0492 

GCNTTNCGGAATTCCTCTCTCTATATGAGACTGAAAGACTATGTTCAGGAACTTGCTAAATT 
T GAGAT T G AT AC AC AC AACAT T AT AA 

SEQIDN04 93 

NNCTGGTAAGAAATAGATGGTGACAGAAAANNTTTNNGGNGTTACGNTNGANGATTCATTAN 
GGGNGANAAANACCAATTTTCCTTTGNCTTCTGTANTAA 

SEQIDN04 94 

GGGCCTTTAGGGAAGGATGCTTTGTTGGCTTATGGTTATGA 
SEQIDN04 95 

TTNNCTCCANTACGGAAACAAGCACCGGCTACCGAGGACTCCNATATGACACGAGAACTTTT 

CAGGTTTGGCGCCCGTT 

SEQIDN04 96 

GGTATGGAAGAGCTCANNCNAAACGNGAGGAANTTTNNGGAAAACAATATGGAGCNTCAACA 
TGGATAGGAAATGTCAAANGCTTGGGCGCT 

SEQIDN04 97 

GTCCGAACACCAAGAGAGAAACCCAGTGCCAATGGAGTTCAATTTTCATACTGAAAAGAGGA 
TTCATCATAATCCGCAATTGATCTGT 

SEQIDN04 98 
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GAAATACACNATTTCNAGCTGNNCCCTNGAATGGATGCCAANNNTGCTAATGCTNGNCCAAT 
GACNGTATCGANAANANGTCGCACACNAGAATTGAGGCTNACAGGGATATGATTACACCTGT 

TGGAGACGCTT 
SEQIDN0499 

GAACANTGATGNTTTCCTCNNGGNNGGCTAAGGNNTNCNCCNACCCNGACAGGGCNTGGATT 
NNGGTTCTTNTTTCNNCGNGTCCCNNNNAATCTGACTTTGACTACTAAGAATTNCATACGNG 

TGGGGT 



SEQIDNO500 

TTATGTTTCTTGAGTGTTTTCTGTCTGTGAAGGTTTAGCTCACACCAAGTTTTCTTTTCATT 
TGCTAACACCAATGTTCCCACTGAAATGTGGGACAAAAGTAGGAAGCAAAGGGTGAGAGCTG 

CT 

SEQIDNO501 

GCGCCTTTGTNTATAATGCACCTTTTTTCTTCTGAAAATATNCTCCTGATGATCTTGCTTTG 
GCNCTATGAATTCATTATTGTTTGTGNTGAATTGGCTAAACCTAGGGGTACCAACTTTTTAT 
TCCTGAAGTGGTGGAACATTTACCTATCTTGTTT 

SEQIDNO502 

ACTAANCNNNCCCCATAACTNCGNTTAATNTACATCAAACCTGTACTCTCTCCATGTAATGN 
GGTTGTNAGATCACTGTTCTCTATACGAGGCTCATTACATACCGAATATACGACCCTCTTGN 

TTCTCTTTTGGCTGT 
SEQIDNO503 

NACNGCGAGNGATACTCCNAAACNGNAAAAGAACTCCGGAACACGCNTGGAGCANGAGATTT 
TTTTGAGCACACAAGGCGGAGCCAAGCTCTAACAGNCNGCANGAAGGAAGNGATGCATGGTG 
AGAGTACAGGCGAGAACACATGACATCTNTAACATACTCTCACATAANCTNGAAACTGACGT 

GTNNNACAGAACTNAATGCT 
SEQIDNO504 

NATCCTCCCNTCNAAAAGCCCGGGTTGCCAGGGNTTGACGTCTGACCGATTTGCAGAAGTAT 
CATTGAATGTTGCTCGTCATATATCTGCAGACTTGGAGAGGNTTTACCGCAATGTGGGGGGT 
CAGCCGCAGGAACAAGCGCCTTGATTACAGTGATGCTGGTGGATTCTACTGCAGAGATCAAA 
GTCTTCTTTAGCTAGCAGTCCTTTTGATTATTCTTTTGTTATCTTTGAGTTTGTAAGAGTCT 

NCTGNTGTTTTGATCATGNTATTTTGCCTTTTATTT 
SEQIDNO505 

TTNTGGAGAAAGGNGTGTAATGNACATTGTGTGTANGCACAACATGGATTTTGT 
SEQIDNO506 

ACCTGGTTGTTCCGANCCACCAAGAGAGANNCCACAGTGCCNNNGGAGTCCANTTTTNATAC 
TGAAAAGAGGATNCATCATAATCCGCCAATTGATCTGT 

SEQIDNO507 r.~~„r- 
TGTGGCAAAACATGTAAGCGAGCAGCTAATCAACAAGCTTGATTCGGAGATAGAAGCCGCTG 

AAAAAGCTCATGAAGATGAACCATGACATAGCTCAAAGATTACTTAGATATAGTAGTTCAAC 
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CTTACTAATTTTTGTTGCATAGTGCAAATAGACTTCTTGAATGCTTTGTAGAGGTGAACCCA 
AAC T T GT CAT AT CAAT T C T AT AGT G 

SEQIDNO508 

AGGCNCTGCTNCTGGGTCCNACTNTGCTACACAAGNAANAAAANAGCAAGCTCTCGTTGGTT 
TNCTCT 

SEQIDNO509 

GGNTCGGAAATCNCGGATGNAAGNCCCCAAGNCGNANGATNNNANGCGCAGGGGTATAGNAT 
GANANNCCTATGCTATANGGAGCTACAGTAGGCNAGNTTATTGAGGCCTGACATTNCC 



SEQIDNO510 

GNCNCGGTTTNNGCTCCGCNATTGATCGTTACTGTGACTAGACAGAAACCTGNANGTCTTCA 
NACTTTNACTVAAAGGAANGNGCTGACAAGGCAACAGGCCTTCCATCCTATGATCACGNAGAA 
TCAACTNTTGGAGCATTTGACAACATTGCGCTATAGCC 

SEQIDN0511 

AANCCCTACTTTATACATGANGTNTGTGAATACTTGTAANGGAAGNATNNNGANNAGNTTGG 
GATGCNAANGTATGTTCTGGTGTTATGCATNCTNCNANTGCTCTTGCTGAAATCCACAACTA 
NAATANTACTTGCACTACATTANGGCTGTNNTTANNCAATNANTAGTTTTTTGCTGATTTGC 
ANCTCCATGTATNGATAGCNGAGNGTNGACAATCNANNATTCCT 

SEQIDN0512 

NANCCCNCTGTAAGCTCNCTNAGGACTAGTNTAAAGGGGGGCAAACANCTGATGAATGCCAA 
CTGAGAT 

SEQIDN0513 

NNCTTTTTTNGTGNNNCATATTNATGTTTNTATNACAAAAGANNTGTNTAA 
SEQIDNQ514 

GCCCCGATNTTTTAGGGNNAAACTCTGCATTTNTGAANGGAATGANGTCTATACGCATTGA 
SEQIDN0515 

ATNCNACNNTTGCNATGCNTNGTNCNGGGACTTGAAGCCNNGCAATCNNCTGNGGAATGCCA 
GCTNNGAT 

SEQIDN0516 

CCNGGANGNAGACCCNCTGNTGGCATCAGGNTATACTAGCNTCAACTAGGGAGTGGAGACCC 
TATNTTGACA 

SEQIDN0517 

TCCTNATNTTAGCGGCCNGNNTGCNGTTCTGGTCANTGATGCNACTNTCGGNCNAATATNNT 
GATGNGTGCGACANNGGGA 

SEQIDN0518 

ATGTNCCGANNTTGTTATCCTNGCATGATNTANGGGAATGATNCTCTNNTGTAAATCAAGGT 
GCCGTAGGTAGTTNAGGGACANTNTATATAACATGCNGATATGNGTGTGAT 

SEQIDN0519 
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GCCGCTNGTATTNATCTGTTGAAGAAATTGCTGNTCAGTTTGTTCTGCAGCAGTATGACAAT 
CCACTTTCTAAGAAGCTCAACGATATCA 

SEQIDNO520 

CCTAACTNTAAGGCCGGCAAGTTCAAGACCAGTTTAGCAGACACTTCCAGAAAATCGCTTGA 
TGGGTGAAACTGAGAAGTGAGGCTTACAAGGCAAACCATTTTGCCATACA 

SEQIDN0521 

CCCTCTNTNATGNCCCNNAGCTGCTGTGTTAAAAATAGAGNCCAAGAGCTCATAAGAATNAT 
GTCCGAGGAAGGATTATACTGTGNCAAACAAATCNATANNTTCATNGTATATNGNGNGGGGN 
ANCAGTGCANCAAGTGTGGGGANTGGTTGCTGGAAAATATAGGATCAGA 

SEQIDNOS22 

NNTAACAACCCATGNTNTANGCACAACAAGTGGAGCATATNCTAAAAGTTCCGGNGAAGAAC 
T T G AG AAG G AAAG AG AAAG AAT G G T AC C G AAAT G GAG AG C GAG N G GAT T T 

SEQIDN0523 

ATGCNNCTTGNNGTAACCTGCCCGACATTTATGCCNTCTNGNTTATGNTTGATGTTGCGTAT 
TCAAGTTATTGACATTTGGCTGT^ACAATTAGTTCAAGTTATTAGTTAGTATCTAGTATG - 

SEQIDN0524 

TGTGCACATGNCTGATNGTGCTTGNTGGNTGTGGNTAAGGATATCGNNGAGCTAGNAGNACC 
NTACTTNGANCCGCTGNCATGATGGTTCGNTNGTNCNNGCTGCTGAGGNAAGACACTGTGTC 
NGCGGGACNCAACTCTCCAGCGCTTTATNAATG 

SEQIDNOS25; 

TAAGGGCTGCTGAACACATCACCAATGACTCACAATCTTTGTGGCGGACTAAATTGTTTTTG 
TTTTTCACTGAAAAAGCCTTTGCTCACCG 

SEQIDN0526 

AANTCCCCCTGTAAAACGCCGCGCCAAAACTGGGGA1SIAAAGAGCGGNCCAGCNNCCGATCCA 
NCGNTGAANNNACNGGNNGNGNCANNANNACNNGAGGGNANTTTNNAGG 

SEQIDNOS27 

TCTCCAGAATCCTCATCAATGCTCAGTATGTATTAGTTCTTAGTGCCATTTTTTGAGAATGG 
CCAGNTTCAATGTAGGGTATAATTTATTGGCTCTTTTGGTTTGGCATTTGTGG 

SEQIDN0528 

AACGGGACCTTCGATCCAGACCTCAGAAACTCGCCGGAACCGTGACAAAATCCAACAACAAC 
NAACGGCTGAAGCTCTCCTTTCAGAAGTGTCGCTGCTGGTTGTTTTCAGTGAAGCAGGGGTC 
ATTGGTTTGG 

SEQIDNOS29 

NTGAGCNCAATTNCTGCCAAGGNCNGNACGGNCGATGNTGAACTGAGNCCNAGAGGNAGCNN ■ 

GCACTTACCCTTATNTNGGGGANGNNGAGGTATACAAGGTATTTTAGTATGGTATTCTTTGG 

AATCATTTCCGCTCNGNCCTAGTTTGTTGNTTCCTG 

SEQIDNO530 

CGTTGGAAANCCGTGANGNNTNGGGANA1SINNNNNCCANAANAAGTCGCCTAGAGGNGACCGA 
NCGNGTAANCAACCTTT 
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SEQIDN0531 

ACGCNNCTNGTNNATNAGCCACTGAACCNAAANNNTNANCTCCGCACGATGCTGACGGCGAC 
GGNTACG 

SEQIDN0532 

TCTTTNGAAAGNCCCTTGCATTTTNGNANAGGNNNCTTTNGCTTAGNCTTAGCAAGCTGNTG 
GGGAGAGTGGTCAANTNTTTNGNCAACANCTNAGCATNCACATGC 

SEQIDN0533 

ANTCCCCTGTNTTCTTGNTCACCNGTGTGGAGGNTGNACTGCTNCNTGGACAGGNCACAGTG 
GNGGACTGACNGTTGNNACAGCCNTATTGNGAGCG 

SEQIDN0534 

TAGCAAGGAAAGGGCTCTAATTCTTGCTCGACTCCTTGGGCGGCNTA 
SEQIDN0535 

AAATCNCCGATNNCNAATACCNAAGGAACATCAACAAANGACNTCTTACTATGAATCTTTTG 
TTTGATGTTTAGAGCTTATTTATTCTTATGATGTTGATGATGATNCTTTAGGCATCAAACTT 
CATACTTATATCTTTGTTATTGTATCTGGATGTTCAACTTCTAAGTGTTATGTTGTTTTTTA 
GTCTTTGAG 

SEQIDN0536 

NANCCCCCNTCNAACAAACCCNTGCTGTACCCATTTNACCGNTTGCAAAAGACATGAGCCTG 
NNGGAAAAAATTTACGATTCTATCCTTGTGATGGTGAAAGTNTTNATTTATGATAAATCTAC 
CACTTTTGATTGGATTTCACGATCCAAJ\ATAAAGGATGGTGTTGCATACTATAAGATTTTAG 
TTTGGAGATCGGTTTCCCTCTTGATC 

SEQIDN0537 

NACCCCAATNNAACAAGCCCGGGTACCGAGNNTCCNATATGATCGAGAACTTTTCAGGTTTG 
GTGCCCGANNTTAGGTTNCTCTTCTGTCTCGGCAATGGCTTTAATGGCCTTCAGTGCCAGAT 
CAAATTCCTCATCTTCACATATCATTCCGATTACTGGCCCATTGATGTGAGTAGGCAGAGAA 
TTGTTCATCATATTANGGGCCTCTTCATCCCTAAACATTATTTCCTTGACATTGATAAGGTC 
TTCCACTGCTCTC 

SEQIDN0538 

TTATNTATATTGTTAGACNTTGGAGTCTGAAATTAGNGNTGTTTGGGNTGTACGC 
SEQIDN0539 

GNAGAAATCNAATCNAAGTAGAGGAAGGGCGATACTGGGAAGGGGGGCCTTAGCN 
SEQIDNO540 

GGGNATGTCAAGTANGACANTATGGNCGANNCTNGAGCGTGCACNATGTCTATTNCAGCANC 
ACATTGANGATANCTGAGGANTGTCGCCAC 

SEQIDN0541 

AAGTTNNGCGANTATCCTTCGCTGAGTNTAAATCTATACANTCTTGAATCCTNATTACACTG 
TTAGAGAGATNATGAAAAAAGGACCTNTGAATCNAANNNCCTACTATTTTGCTTCGCCTTTA 
CC 

SEQIDN0542 



FIGURE 4 (continued) 



BNSDOCID: <WO 030851 15A2_I_> 



WO 03/085115 



116/140 



PCT/EP03/03703 



CNNTNNACATTTAACAAGTGAGAGTTTGAAGCCCTTTCAACTTGCGCATGTGAAAGCATTGA 
ATCTTGCAAAAGGGGAATGCAGGATGATAAAGAAACTCTGNCATCTGTATGGAATAAAGCTA 
TTGAAATGTGCGAATCCAATTCACTTGCAANCTTTTTGAGAAGACAAGGGAAGTTGTCATCA 

ATTCGTC 
SEQIDN0543 

CNCTTTGAAGNNCCACCATCGTACANGGGANAANACGGCNACCANAATCCGGNCAAATTCNG 
GNGNNCTNCCNGAACNCNTNTTTTTNTTTGGGTGCCACCATCGNACCGGNCAC 

SEQIDN054 4 ; BSTC4-34-18 5 

CTGNCAAACCCNGGGNAGTCAGGNAAACGTCCANCATGGATCTGGATCNNGGCACAGNGAAG 
GCAACGCNANCGACNTAGNNACNNNANGACTGTATNAAACANAGNCNGGANTNATACTGANN 

NCANNNANNAGNNTANGAAGNTTCANGGCNC 
SEQIDN054 5 

TTGNNGTNGNAGGNGGAACGNAGGGCAGTTTNNTTCCNAGGGANCACCANCNANNNNGNTNN 
TNNNNNAANNTTTTTTGNTATANNCACACGGANNTNNNNACNANCGAGGGGGGNTTTTTTCT 
ACANTNNATTNCGTGGGNNANAATCAAACGATGANNNCNGNGNNTNCNGNGGANATGNNCGA 
CNNGNNTANNGNTCGACCNCNACCACNNNACNGGAGNNGNNGANNGTCGNNNCTCATTAANG 
AGAGNTTAANCNGAGTGNAGTNAATNACGNCNANANNGANATNTANNTTTTNNNCNNGGNCN 
NNTANNTANNNTNACNTANNACNNNNGTATNNTNCGGNNGCNTTCCCANNNNNNTNTANNNC 
TNNNTCGAATAAGANNCNCGGNCANGNNCNANTCCCNGCTNNNCAAAACACGNNAGNGGAGG 
GTCCGCGNAGGCAGTGAATCCCGTGATTNANCTACAAGTGCCTTGNGTGCAGNTGNCAANAA 
CAGGAAATACTTNTGGAATAAGTGATGCATNCAGAAATGCTACTTCTGGCTCCAAAGTTGCT 
GACTGCA 

SEQIDN054 6 

GAGGTACATAGCAGCTACCAGGCAGTGATTCAAAGTAGAGCTGCATTCCGTTCGTAGGCCTT 
AGAATAACTTCCTAGTTCCTATATACTGTTTCCATTTTATTTCAGACAGTATTGTAATTCCT 
TTCCAAATATTGTATTTAGTATAATCCCGAAGCTCATGTACTTGTGACTTCACATATTGGGA 
TATTCGCGTTAGATGTTGGTTTTAGACTTATTGTGTTTGTATCAGAATTGCCTTTACGTTTT 

GTTA 

SEQIDN0547 

CTCTTTGGAAAGCCCTCATNGNGTGAGAANACNANGCGGNAAANNCTNTTGNNACGCCNATT 
ACTCAGGACNCATCATTTTTTTCNNNNNCACGCTANAAGGGGGACTATNNGGCCTAAGGANA 
TNCAGGNGGNNANGCGTANTACGGGAGAAAGGGC 

SEQIDN054 8 

CCTTTNGAGGCGGCATGGATGTAGCAGGGAAAGGCTCTAATTCTTGCTCGACTCCTTGGGCG 
GCNTA 

SEQIDN054 9 

CCNTTGNTGAGCCTATCTNNGTTCCGAAANTGA7\ACCGACGCTAACTTTCTCCACTAGTCNG 
CCTTTCAGTA 

SEQIDNO5 50 

CTTTGGNAAGACCGCGAAGTTGAAGGACAGGGAGAGATGANGNGCGNCTCCTTAGGGNACGA 
TCCCTANGNCNNACCGCNNTCACACAGNGTNTGGGGTA 
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SEQIDN0551 

GTTNATNATGCGATTCTTTTTCTGCCTANGGTGGNAGNGACCAAGGAATTGCAGGACCAATT 
TTTTTTGGGTTATNTATCCCTGCTCTAAGGGCACTTCATTGGTATAGGTTGNAAGTGTAAGG 

NNTATTGTTGGCTGGCTA 



SEQIDN0552 

GAANNCCCNGACNNATTTGGGAAAACCACCTGANGAAGAAANGATATGTNGCATNTAAAGNT 
GACTTATGAGTANNAGGCTANGATNTGTTACCANACCCGNGNTGGTAATCNNAGNACTATAT 
NGAACATNTTANTTGNACCTTCTNANTACATNANCNGNTATGAGNACCANTATTACNCNGNA 
CTTNATTNACANNTGCGNNGNNAGGANATTANNGGTGNCNCTNGATCGANTTCTGACTCATA 
NTNTNACNNANCNAATGNACNNNTCNAANGTNNTNANATNATNNNTCNCGTGAATCGAGNTT 
TAGCTATNGCNGCNNACCACGTGAAGAAGAAATGATTTGTTGCACGTAAAGCTGACTTATGA 
GACNGAGGTTATGATATGTTACCATACCCGAGTTTGTAATCTTCGCACTATATTGAACATCT 
AGTTGTAGCAGTTTTTTTTATCATCTGCTATTTGTGCATTA 

SEQIDN0553 

TNAAAAANAATGGATAGCACTAACACAAAGGCGGCAAGTTCAAGACCAGCTTTGCAGACACT 
TCCAGAAAATCGCTTGATGGGTGAAACTGAGAAGTGAGGCTTACAAGGCAAACCATTTTGCC 

ATACA 
SEQIDN0554 

TGCCCNGTGCNTGGTTGTGGCNAGNGNGCTAGANGANTCCNGANGAGGNGNAGACCGNGAAA 
CCCACCGA 

SEQIDN0555 

CTTTGGAAGGGCCNNAAGCTNNNGGGANTCNGCNATAGGGGAATNAACCNATGTGCATGCAA 
CAAACAAGCCGNTNNATGTCANGA 

SEQIDN0556 

CTTCGTNNAGANCAGGGATTGTTGNTTTCCAGCGNACGATTCGAGGTTCGGATTNGGNATTT 
CGATGTCTCANTCCANGGGATTGTTGCTTTGTTTAGCCCGA 

SEQIDN0557 

CNTGNCNTNCCGCNGCTCCTNCNGTGANNNCNGCTGCTNTACGGAGCTGATNCTGTNNNTGT 
CAAGGAGGNCGACACAGGTANGNNCCNNCGNGAAAGTGTGTANATGACAATATCAAGATTGT 

NNGGAGA 
SEQIDN0558 

CTTTGGANANTCCGAAGAGATNAGGNAGACGACCCTGATCCTGNAGGCTGAGCAAGAANNNA 
GNNCACAATGAGCNATNGCTANGNNAGCNGACANGCAAACTANCTCNNAANCTNTNCTGGTG 
ATNCCGNTGATCANGGAGGNAGCCTTCNACCAGACAGNCNTGACAGGA 
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SEQIDN0559 

CNCTTNGNCACAGCCCTATTTGTATTTATGTTTGAATTTTATGACAAAATGGTCGTATTTTT 
CTCA 

SEQIDNO560 

ACGACGCGTANAANATCTGAAGGATACCTATGNNCAANCGAACCAATGCACGGATATCCNTT 
TATAACCCAAATCTTCAGTNGNGAATATCTCTNCAGTTCCTTTTCTATTGC 

SEQIDN0561 

CGAGTTTNATGGCGNNGCGATGTGGACATTCGTTGTGGNGGCCTAATGCTGAAAAGGGNTAT 
TGATATGGCAAGAGGAACCCTCTGCAATGCAGAAATTGANNGTGGCTCC 



SEQIDN0562 

ACNTGACTTCNTNCAACCAGCCATCTATANNANAGGAAAAATANTNTGAGGATTCCCA 
SEQIDN0563 

CTGTCCNTTTTNTGNGACCTNGTGCNGGCNTNCTCTGANNGNGCCCNGTNAGCGNCCAACTC 
NNATCAAGCTCCTTNCAANTGANTGAGGACATGATGNGGTNATTTACTCGTGANGAAAGGCA 
GCTNATTCCTGACCCNATGGAAGCAGNNAGGAAATCNGCTCCTNGCTNCNNACTGNGCANGG 
NTNNANNGTACTCGNCCATACNGANGTCNCACANNATTGCTANATTGTTNCTAGCA 

SEQIDN0564 

NCTGGCAGTACCAAAGGTCCTATGGATTGTTACTTCNCGCAAAAATCTGGAGATAAGGAAGG 
AAAAAGTGGTAATCCTCAAATTGATGCCAAANCGATTTTGAGGGATCGTGCAATTACAATGT 
TTGCGCGGTGGATGTATGATGCAGGTCTTCCTT 

SEQIDNG565 

GNNGCTAGCAGCTCGGAGTNTNTTGNGGTCCNGCNGAAATTTTTNNTGGNGNACACNGGAAN 
TTGNNNNATNTCTNATGGNGTATGGTAAGAACTNATTTTTTGANATTGANGGNCGANATGTT 
CTTNGGGGGGGNNCCGTCACACCTGTCACTTCATTTCATTTTT 

SEQIDN0566 

AAAGGAAACTAGTTGGAACTTGTT 
SEQIDN0567 

AAGATGATGAGCAGATTGCAAGGAGGAAGCACCT 
SEQIDN0568 

ACTACGACTGGCAAAGATCAAGTTGTAGTAACTAATAAATACTCGAGAGAGAACAGTGGAAA 
TCTTTTTGT 

SEQIDN0569 

TNCANGNCGCTGCNCANGTTCCTNGGNAAACAGGCCGNCTTGGGTTGTACTCAGGTACTCAT 
GAAACTTGTATAGNGCTNGTAAGAAGTTTGNGTNGTTCGGT 

SEQIDNO570 

CTGGCNGNACCAAAGGNCCTGNAGCCCGGTANATTCNCCCCTGTAGCTNCANACTTCCTGAN 
TNTACTNTNGATNNNACATTATGGGGNNAGACCACNATNTNNATNNTCNTCAGCTNGTGACT 
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TCATGAGNTNTCTTGGCCATGNNAAGCTAAGACATCAA.TATGTGAGNGCGNTCACGAGCATA 
TGCCNGAGCAGACATTCATAGAGACTCTNTTATTAGTGG 

SEQIDN0571 

TTTCTGCTGNNTCAGTGAGGTTAGATCGTAATGGAGCACTTTTTATGGAGAACATCAAACAA 
GAAGTTGAAAGTATTGATGCTGATGTAACACCTTCTCGAATACAAACTGCCT 

SEQIDN0572 

TTTGCATGNCTTCGAAGGNCAGTGCTTGNTCTGAACCCGTNNCTTGGACTTGACT^ACTAGCA 
TCTTCTCTTTGCATGCTGCCCTCATGTATTGCCAATGTAATTTCTCCTCTAGCAAACCATTA 
TGTATTACAAACTATTATTATGATTGTGAATAACTTGTGAAAAGTTCAATCAATCTGAAAGA 
AAGTAATCTCTCT 



SEQIDN0573 

GGNTTCTAATTTCTAAGTTGATGGCTCAACCAAAGATATTTAGTACTGAACTGATTGTACTA 
ATTGTTCTATAAAATTACGGGGTTTAGAT 

SEQIDN0574 

ATATGATGNAGTCCGGAAGATCGAATNTGGGGAAGGTCCTTCTGGGATCAAATAGGCTAGAT 
TTACTTGTTTTTCCTAAAAATGTAATAAGGCCAAGTGCCAGTAGTGACTTATTTTATTATTT 
TAGTGTCGTTTTGGGATTCGTCTATTTTTATATTATGAAATGAAGCATTTATTGGCAT 

SEQIDN0575 

TTTAGCTACGANGGTTCTCTNCGAGATTATATTCTCAACGCTNATGCNCACGCNTTTGCTTC 
TCGTGT 

SEQIDN0576 

CAGNGCTNGAGCTGAACCCGGNGCTNGGACTTGACAACTAGCATCNTCTCTTTGCATGCTGC 
CCTCATNTATTGCCAATGTAATTTCTCCNNTAGCANANCATNATGNNNTACAAACTATTATT 
ATGATNGTGAANAACTCAGTGAAACGTTCAANCAATCTGAAAGAAAGTAATCTCTCTTTCCT 

SEQIDN0577 

GGGAGGNAAANTNGCCCTGAAACNTAAGAGGCTGAGACTTGTCATAAAGAAACAAACTNTAT 
TCANGCANGAGAAGAAAGCAGTAAGGAAAATCAGCAAAATAGCAATGAGGTTAGCAAAGTTA 
TTGATAACAACGGAGGGACCAAGGATGTACAACACAAGAAGGAGAACATGAACAAAAGAGCT 
AT GAC C ACT GGAAAAAT T G AGC AG AT CAT G 

SEQIDN0578 

ATACGAAGGTTCAGTGCTAGTAGCTGAACCCCGTTGCTTGGGAATTGATAGTTTGGGTGACA 
G 

SEQIDN0579 

AGAAATCTGCCNGGTTTGCATGGATATAAGCAAATGCTCAAGAATGGTGCTTGGGAACAGTG 
CATGTCTGCCTTGGAGCCCTCTGTGAAGGGCAAGCTG 

SEQIDNO580 

GCTCAAGGGGAGGTGGCNCCAGAGNNNAGTGCGNGGTTGGGGNAAAGGGTGCAGATTCTNCN 
AAGGNCCGTCAAAAGAGCCCATGNCCAACAATTTACTAATGATTCACAAGAGCNNTGGGGGN 
GG 
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SEQIDN0581 

NATGTGTATTCCTGAANNANCTNANTGN1SICAATTATTCAACCANTNATTNTACC7^GTTCA1S1 
TGTTANCCAGANTANNCNTCATTNATCTNTNTACATGCNTCACTAAGATNTTATTTGTAACA 
AGNGGTTTTGTTGGNTGG 

SEQIDN0582 

GCTCTAAAACCAACCTTTATCAGTCAGAAATCAGCTTTCAAACTCCATAAACACAGCAGTTT 
GGTTTTCTTCACCATCGATTCTATTTTCCGGTCGCGGTTCGTCACATTTTTTGAGTTCAAAG 
CTATCAAACAATTGAATTTTAGACTTATTTGAGGTTTATTTCTCCCTTTCCGCTATTATTTT 

TGG 



SEQIDN0583 

TCTATCACAATAGAGTCCTTTGCTCGGNGAAGAGATGGGGCACATCAAGCCACATTATCCAC 
TCATCACAATAGTAGAGGCCACACACAGAGAGGAAATGCGCCAAAAGGAGCTGCTGTATATA 
CACATCAAAGTTATAGGAAGCATGCATCAGGAAGAGGAAATGGGCTTGTTGGAGCTGCTCTA 
CATTCTCGTCAGAATAATATGGGCATGGGCAGAGGACAAGTGCCAAATGGTGTTCCTCAACT 
CAATCATCGC7VATGTGGGAGGTCAATTTCGCGGACGCGAAGCAAAGAATTCCCATTTGG - 

SEQIDN0584 

NGGAAGGNGTTTGGTGTANGGGTGGGGGGATTGGGGTGGACCCCAGGTGGGGGGG 
SEQIDN0585 

AAANGAGGAAAANTAATNTATGGCTANNNACANATGACAAGGACATAAGGTAACTNNGCATT 
CTANCC 

SEQIDN0586 

TTTGGCTGCCNTTTGCTAATCCNTTGCAGTNTNTGTGCATAATNNGAGTAGGGGTATGAAGA 
TGCCCACCTNTTGTTCATTCACTTNAAGGATAATTACAAGCCAACTATGGAATGTGACG 

SEQIDN0587 

AGANTAGAATGTTGTAAGAGTATTGAACTCAAAGCAGTATTGTAAGTTTGTAAGTAAGTTGA 
AAGTATTGAACTAAAGGCTCGAGGTATTCAACTCGAAGTAGTGGTGTAAAAGTATTGAATTG 
GAAGTGCG 

SEQIDN0588 

CCCCTCATTTCACACATTCTTGAAACCAGGTGCACTTGCCCAAATCAGGTACTCCAAAATCT 
CTGCTAAATCAAGGTTGAAAACTGTTCAGTCCCTGTTTGCTATTTATAGTCAACAAATTCCC 

SEQIDN0589 

TTTGGGAANCTTTTGGAAGGTCCCATATACGTNTTTNNCAANTNANCCNGGGGCCTTCTNGG 
TTTTTTTTATGNTTTNATACGTCGGNTTGAGAANATTGNNTGNTTACAAGNAGGTGAGGAAT 
ANATAATATGATTCCTTATCTTCTTTGCG 

SEQIDNO590 

CAGCGGAATGCCACCGAGGCGATACCAGCGATGCTGCACGTGATGGCATGNTCTGCTTCGGC 
GCGACCTGGGGCAGAATAGAGGAATCCGGTATAGCGCTTCTCGCCCAACCGGTACTGTGCGA 
GGAAGCTAATCTTATTCATGCCG 
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SEQIDN0591 

GGCCNGTAGTTGGGCTNGNNACGCNCCNNAGNACCNACTGGCCCNGNNAANGAGNATNAGNT 
NNTCATGCNTTATACNGGNACTNACAACCCACCANCCATGCCATAGCAAAGAAGCGAGNTAT 
AAACACAAGNTCNGGACCTNTGCCTATNCCAATCAAAATTTACAAAGCCACGGNTACAAACT 
NCTAAACG 

SEQIDN0592 

TTTNNNCCTTTCNTNNCATGNTATACGTiAGGTCAGTGCTAGAGCTGT^ACCCGTTGCTTGGAA 
TTGATAGTTTGGGTGACAG 

SEQIDN0593 

CCAGACTCGCGATANCTGNNTNANCTAACANTAGCATTNTGANGANGTACCTGNGACTTNCA 
CATAGCAGCGGTGGGTCGAACAG 

SEQIDN0594 

ATCTATTGGATTTATGCTTTGGNTTCTGCTTCTAAAATATAGAAATTCTGGAGAATTGAAGC 
TCGTTTCTATTCGAGGTTGCAATTCCAGNTCGAAATCATGGNCCATAGCTCGCTCGAGGATT 
GCTTTTTCTTTGGAGATTATTTTGCACTGNACCCGTTGAAAAATTTTCAGNAACAAAGGTCC 
ATCTTCCCCATTGCAACTTCA 

SEQIDN0595 

AGANCCTATGATT^ACANGATNGGAGGACTCATGGCTNAGGCTTGGCTGGAAACATCGGNGCT 
GGGGCCCACCACTCTGAACCATATCGTNTAGGACGGTCTCCTACTAGCTGGCCTCCACCTCT 
AGCGGTCTGGCCTCCACATATAACGACCTGACCTATACCTNTAGCGGGATGACCCCTACCTC 
GNNCTCCCAGACCCCTACCTNTAAATGGCTTGGCAGGCAATGCCAGAATCATGGCACGGTGA 
ACTNTGNTACTGCGACTGAACACCCA 

SEQIDN0596 

TCNCAGCATCGCAAGTGATTTACTTTGNCTGGNGCCNCCAAGNTGGAAGGANGTTAGCCCTG 
TAATCAAGGCGNTNNTGNCCTTGCCTC 

SEQIDN0597 

CNCANCAACCNTTGTANATATGCNCTTTTTACGCTCGAAATTTTTTAGCTGATTGAAGAGGG 
TNNTCTCCNTCTTGGCAGGTATAAGGGGAAAGAAGCTGCTTATTGTAGCAGCAAGTTAGNGA 
TC 

SEQIDN0598 

GAGACCGTTGGCCGCATAAACAGCTCCANCTGAAAAGGGGAGTAATTGTTTTTTTTCTTCTT 
CTTCTGAAATATATATAGACAAAAGAAAGAAAAATAGGAATGAGAAAAGGGGGAAAAGCATG 
TGTTCCTAGCTATTAGTTTCC 

SEQIDN0599 

CTTGCCGGTCCATTTGCAGGTTGAAGTGGCAGCTTCTGGATCATGAAACGATTGAGTGCAGC 
GTCTGCCAGCATCCATTCCTTG 

SEQIDNO600 

CNCCACGTCTNTGTGCCG1SIAGCCNCCCCCTCGCNCCAATNCGGGTGTCATTNCANCG1SICANC 
GATTTTTACCTACAAGATAGGTGGNTCGGATCGANNCGCNACATTNGATCAGATTTGNCGGT 
GC 
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SEQIDNO601 

TNCNGNCCCNNTTTTGCNCAAANCCTTGAANCTCCAACCACTACCACCCCAAAATACCNACA 
TNNNTNGATTNAGCTCTTCAAGACCTAGCTATTGNTGNCAATTCTACCCCAAAAATCCGGCG 
ACCAAAATCTGGC 

SEQIDNO602 

TCNNNAACACACCCTAACCTTCAACNCCC 
SEQIDNO603 

NCAACAGCTCTAACTGAAAAGGGGAGTAATTGTTTTTTTTCTTCTTCTTCTGAAATATATAT 
AGANAAAAGAAAGAAAAATAGGAATGAGAAAAGGGGGAAAAGCATGTGTTCCTAGCTATTAG 
TTTCC 

SEQIDNO604 

NAGCCCGGAGCTTTTNAATTCTTTCATAACCCAAGGAGAAGAATAGGACTCTTTACCAGTAT 
CATAACCTCTCNATGGGAAATGGAACTTAGATCACGATGTGAACCTACTTATGAGTGGAATT 
TCGTTGACAAGCAAATTCCCCGGGAAAACAAACTTTTCTCCAATTGAGATGCTCTCTTCATT 
TATGGATTCTATGCGAGATTCGGTTAGTGCGAAGTGTGATCCTGGCTNAGAAGGAAAAGCTA 
TATGC 

SEQIDNO605 

GGNCCAAAATCGGNANCATCTCC 
SEQIDNO606 

TNCCCAGCATTCCGCGCTACCANAGAAAAAGATGGATCCACCANAGATNAAACAAGTTATAT 
TGGGTATAGGCATATGACGAANACCAGAGAGACAAGGGCAGTTCTATGAT 

SEQIDNO607 

GACCCTTCCCNCACCGTNTCGNATCTTCGNTTGAAGANTCGAGCNGGACCCCAACCTATGTC 
ANNCCCCCCCAAATCCATACCAGGNATCCANCTGNCCTCCCTTGNGACCAAACCAAGCTTGG 
CTTTGNCCGAATNTAACCAGAAAANCCCANGNCCNAANTCAGGTCCAAGAACCCTAGAAATC 
CGGAATCTGAGGGTTTTGTNNGA 

SEQIDNO608 

TGCTGTTTTCAGGTCTGCTGATTNTGTGACGACGTTAGAAATCTAGTCTCAATCCCACTGTA 
TGTAGTGTAGAGTA7\ACAGTTTTGTTGGGCAGCTCAAGAGCTGCTGCAGGTATTTGATGTTA 
GTTCCACGGGCTCTCCAAAATCTTGAAGGCCAGATTTGAAGAAATATCCTAAAAATATGTCT 
TCTTATTCG 

SEQIDNO609 

GNAGNGGCGNNCAGCATCNNTNGATCTGAAAGGGAACATGATTGTNTGGT1SIAAACTCGTAAC 
GGTAATTAATNACCTTGNTANGTCC 

SEQIDNO610 

GGGGTCGGGTTTCCGGCGAGTCAAGGNGTAATCTTGTTGTTCTGACAACGAGTCGATGTNGA 
AAGTCACGTAACTCAATACCAAAGGAAAGGGCN 

SEQIDN0611 
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CCCCGGACGTTTGAATCTGGGCCAGGTCCTTCTGGTACCAAATANGGCTAGATTTATTTACT 
TTTTCCTAAAATGTAATAAGGCCAAATGCCAGNACTGACTTATTTTTTATCATCTTAGTGTC 
GTTTGGGGATTCGTCTATTTTTATATTATGAAATGAAGCATTTATTGGCA 

SEQIDN0612 

CCCCNCAGCTTNGAACATAACCCCCCGAGCATGACTGCTTNTGATTTACTTANCTTATGCAG 
TTTTNNANACGTTCCCACAAGAACACGTTCNTCGTTGNCAAACAGAGATTNGAAGGTTTGTC 
ATGATTCTGTTACTGNAGATGAGAATGCTCATGAGGGCGGGCTCCCTAAGGAAACTGAAGTG 
CATTCCCAAGACATCTCTGTGGATGCGAAAAGCCTCAATTCTGAGAAATTGAAAGCGCCATC 
CATGGAGGAAGAATCATGTCTTACTTATGCCA 



SEQIDN0613 

NGCCTACNGGCACNTCGGCTTNNTACTTNTGTGGATGGCTCCNNGCTAGCCAGTNTNAGANA 
NTAACNGNTGCATCCGNGACNTATNNATGAATTNCCATTGTTGTCNGATGGTNGGTCAGGGC 
ATAACCTGTTANGNTGGANANCATGATGTGCTGTGGATACACAAAGAATGNAGGCAGACATT 
CACAGAGTGCTTTCTCCAATAGCACAAGAAAAGGAACCATCGGTTTNTACACCCAGAGNGGN 
AACCCCNATTGTTTCCAANCNAAGCAGTAAATTCATGGGAAGNCCTTCTTCACAAGCAGGNT 
CATGGAGGCCCAAGCATCCAACAGTTGTTGCAATAAAGAAGCAAATTGTGTGGAGTCCTCTG 
AAGATGAAGGCCATGAGAAGTAGGCAATAGGAAGCCCCTCTCTTC 

SEQIDN0614 

ACAACGGCTAGGTTCCGCGAGTCANCCTGGNAAAGGAGCCTGGNNANNGTANAGANGACCGA 
CAGTNNCGNATACAGNCNCGAGAACGTNA 

SEQIDN0615 

CCAATAGGCTCAAAACGCAACAAAAACCAAAAGAAGAACGAAATTCCCTTGNTTGGATTCAT 
AATCTCAATTGTCTTGTTTTGTCTGGTACGTGAAAATGTTGATA 

SEQIDN0616 

GGAAGTAGCTGCCTNCTTGTGNTGAAGGCTTGCNGCTGTCTNCCTTCATTTGTTAGCCTAGT 
AAANNTGGCNTATATNTNCGATGGCCGCTCTCATGTGNTAAGCACNTTTGCTNAACCATTTC 
TATGATAGCATGAGAATGATGATGCTATGAGTTACAATGCTGGGA 

SEQIDN0617 

NGGAGGGGTCCGGNAGATGAATCTGGGAAAGGTCCTTCTGGTACCAAATAGGCTAGATTTAT 
TTACTTTTTCCTAAAATGTAATAAGGCCAAATGCCAGTACTGACTTATTTTTTATCATCTTA 
GTGTCGTTTGGGGATTCGTCTATTTTTATATTATGAAATGAAGCATTTATTGGCA 

SEQIDN0618 

CAAGGTTTTGGTCTTTCTTTTTTGGAGATTGGTTGTGCTATCTTAGCTCCA 
SEQIDN0619 

TACCCACNCCACCTCCCGCTGCTGNTCCTTTGNCTTCANCTCATTCNAAGCNTGACNNCACT 
NCCAATGTGTAAAGCTNAGNGGCGTACTCGCT 

SEQIDNO620 
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TNGTTGCTTCTTCTCCACGCCTTCTCCGGCACTACTTCTTCTTNTCCGGTCGAAAATTCGGC 
AGATCCCTCTCATTTTCTGGCTGGGCCGTTCATCTTNCTCAGCACACCACAAACAATCGATC 
" TTCTCGACCTCTCAACCATAAAGCCACCATCGAATCCCTCTCATCCGTTTACTCGAACATAC 
TAGTTACAGAACTAAATAAACTTTCAAANTTTTTGCTGTT 

SEQIDN0621 

GCACTACAAGAAGCTGCTGNGGCTTCTTGNAGGGTTTTGTGNGANNATACCACTCTTGATNN 
NTGTTNNCNCCGATGGTTATNGGTTTCANNGGAAGCNTCTTCAAGTCTTACAAATCTTATGA 
GGNAGC7VANAGNAGTATTCAATGACTTCCAAAAAGAA^ATATTTACAGTGAAGAACAATCTT 
CAAGTTTGTGTATTGATGAAAGTGATATTGGAGCAAGTGTTATGTCATCTGTATTGTTAGCT 
GGAATGTTTGTAGGGATGAAGATTTCAAAAAGTTCTTCAGTTTGATTGTTGATAAGAGTATT 
TTGCTCAATTTTTTATTATNGCTTAGNTTGGGTATTATTAGNTNGATTGTNNAGTTTGANGN 
NATACTGGNTGNCGCATTCAACCTCTGTNGAATNGAGTATTTAGGATGCCNAAGCCNTTATC 
TTTTTGACTCCCNGTTGGNATGNAATAAAANATGTCTGNTGATT 

SEQIDN0622 

ANTACANNTAAAGGTNTTAGCTGCTGACATTTNGAATTGTCGCTCAAGCTGNTGNTTGGATT 
GCTTGTCNCTGAAATTTGNATTTTTGAGTGTTCGAGTNCGATNNCAATTTCAGAAAGTGAAG 
CTACATTNTGTTGAATCTNCTATTG 

SEQIDN0623 

CTAGGCGTGTTAGTCGACAAAGCATAGCCCACGTTCTGTGTTTTTGGATCGCAGTTCATCGT 
CAAATTCTAGGCGTAGTTCTAGTGGTACTAGTTCGAAGCATCCGTACAGTAGCT 

SEQIDN0624 

AGAACCAATCCCCAAATTTTTGGGGTACCCACTCCACCTCCCGCTGCTGCTCCTTTGCCTTC 
AACTCATTTCAAGCATGACAACACTTCCAATGTGTAAAGCTTCGAGGCATACTCGCT 

SEQIDN0625 

CAATTAGCNTGTGCNAGNCANAANAGGGAAGAGAAGNAATNTTTGTATAGCTTCTTGACAAA 
TGTAGGTNTTAGTGATCCTTGNTATTTACTTAT 

SEQIDN0626 

TTGTAATGCTTTGTTATCCACCACTGGTGTCGAACAATGTTCAGTGTTTTCTTCTAATGGTT 
AGTTCAAGTTGTTGTGGATAAATGATTATACTGTGCTCTTCGTAAACATAGGATGCATTTGT 
ACCAT 

SEQIDN0627 

TGGNTGATATCATTATAGATATAGGGCTTCACTCCCTAATCNNTNTTTTTCCAAGGTNTACA 
CAANCCTGATTNTTCNNCT 

SEQIDN0628 

AGGTTGATGAAG7\AAATGAAAGACTAATAGTTGATGAAGTATGTGAAGCAATGAACAAGATC 
AATGTTTACAACCGATCATGAGTTTGAAGGAGTAGAAGAAGAGTGTGCTGAATTTGCATTTG 
CCTAAAGGAAACCACTTGCGTTTCCCCG7U\GATAACTGAATGAAAAACTTTGTTTTTTTTCC 
GCTTTCTGTGAAGACACCAATAGCTGAGGTGTTTTAGAAAGTATTACATTCTG 

SEQIDN0629 

NTTCNANCGAACNNTCCATGTGCTCATTNCATGCAATGCTGATGNNNAANNGTGTCCANNNG 
GCCGTTTACNCNTNGG 
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SEQIDNO630 

TGACCNAGGACCAANATTGAAGGAACATCAACAAANGACTTGTTACTATGAATCTTTNGCTT 
GNCGANTAGAGCTTATNTATTCTTATGATGNTGATGATGANGCTNTAGGCATNAAACTTCAT 
ACTAATATCTTTGNAATTGCATCTGGATGTTCAACTTCTAAGAGTTGTGATGGNCTTTAGAN 

TTTGAG 
SEQIDN0631 

TTTCTTCAAGANTGCCAAANNAAGCATGCAATGAGCAACGGTTGTCACACGACATATAGCAC 
TGTCAAGTTACTNACAAAAAGTGAGAAAAAGAAAAATGAGAGAGTCTTACTAGTGAAAACCT 
CCACGGGCACTGTAAGGCGACGGTAAGCAGAGATGAATAAATGAGAGAGACTTGTTGGTGAA 
AACCCCTTGGGAACTACTTGTCGAAAGTGAGTCGTGAAGCTGATGCGAAGAATTGGCATAAA 
CAAGCCTGACTTCAAAGGTCATAAGAATGGTATAAGGGGAAGATTGGATTAGTTTGGTAGAT 

CGGTCG 
SEQIDN0632 

GTGCAGGAGNTGGCCCAAAAGNANGGGAGNTGAATTTACTAATTCTGNTGNTGGC 
SEQIDN0633 

CCACNCCCCCTATTTTCCCCTATANGCCCNTTCTACATTGGCACNTTTCACAAACAAGNACG 
CTNACCCTTTNTTATGTNGGACTCTGTACNC 

SEQIDN0634 

GAGGCNTTNCATTTGANCTTCATTGNACCAACAACTTNACCACCATGGCACACTAGTTCCTT 
GNCGACGGGAAGCACCATGAAAAACGCTGTCCCTCACCACTAAAAGCTCACCGGAAAATAGT 
NGCCGGATAAGCTTCAGCACACCCAGGACCCTTCTCGCATCTCCTTCACACCAGCGACCCCT 

CCCCCCCGGNCG 
SEQIDN0635 

NNNACGNTCTCGAGTNTGNNGCCTTTCTCAAGACTGCCCAAANAAGCATGCNATGNGCAACG 
GTTGTCACACGACATATAGNACTGTCAAGTTACTTACAAAAAGTGAGAAAAGGAANAATGAG 
AGAGTCTTACTAGTGAAAACCTCCACGGGCACTGTAAGGCGACGGTAAGCAGAGATGAATAA 
ATGAGAGAGACTTGTTGGTGAAAACCCCTTGGGAACTACTTGTCGAAAGTGAGTCGTGAAGC 
TGATGCGAAGAATTGGCATAAACAAGCCTGACTTCAAAGGTCATAAGAATGGTATAAGGGGA 

AGATTGGATTAGTTTGGTAGATCGGTCG 
SEQIDN0636 

TCCCCAAANTCTGNTTGAATGAGNGNGCCCANACCAGGACNGCTTNGCCGCTAGACCCGGAC 
AN ACN T CT T T TC G AN AAACN CAT C GAN C AGGGC A 

SEQIDN0637 

TTTGGAAATCGCCCAAGACAATTTCTGGNATCGGGGAAGTTTGNAGAATNNATGCTATTGGC 
ATAANTCAGNAGTTTNNAGATNCGAANCTGCCANTAGACTCGCTAAAGCTGGCGCCTNACNT 

A 

SEQIDN0638 

TGCCCTAAAGCCGGGGAAAATCTNATTGGNGGCTGAAAATGAACCAAAAAAGCTGAAGACAA 
AAGGAATGATCAAAGAAAAGGTTCGTAAATTATATTGATACANCTCTAGACAGTCTCCA 

SEQIDN0639 
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TCAAAAGGCAAGCAACCCCTTTGGTGGGCATAAGGGTATAAATGCCG 
. SEQIDNO640 

NCTCTACACAGAAACTCGAAACCTACGCNTGACGGTCACGATTTCAGTANCCTTCCNNCTCC 
TGNGT 

SEQIDN0641 

TGNGGTGGGGGAGCTCGTCACCTGTCTATCAGGACCTTGNGTATACTGGCCAACCTGAAGCT 
ATGCAAATGTCACGACNCCTTAGTCT 

SEQIDN0642 

GATCCCTCNCTCAAATGCATTCTGATCAACTAAATTTGAAAGGCGAGGGCAATCGATGTTAT 
AGAAAGGGGTTTCGTTGGTGAATTTTCTTTGTTCATTTTGCGCAACAGCTTGTTGTCTTGAT 
AGTGAAGGAGTTTATTTTGTTTACAGAATATTAGTC 

SEQIDN0643 

CCTACATCACCAAAGCTATCATCTATGAGCTGGTGGAAGGATGGAAGCACTCCATGTTTCAC 
ACTGATTGAATCACCCGTCCTACCAAAGCATTGATGTCTTCTTCTTTATGATCACAGGCACC 

CTATTAC 
SEQIDN064 4 

GAAGGCCNTTNCGTTNNACACCAATGAGCCCTTTTCTTCTAAAAAACAAAAACACATTCAAA 
AACCATCCTTAGCAGCAGCAAAAGACCTCTAAAAATAAGTTCAAACCAGCTTTTTCTTTCTC 
CCTAAATAGTATGAAACCCGTCCAAATAAGC 

SEQIDN0645 

AATCTTTTCACCATCGGCCGCAATAATCGCCTCTGCGGCACGTTCAATCTGGNGTGGGCTCA 
AGAACAACAAGTATTTGGTCTGCGGATACTGCGCTGCTGCTAGTTCTTCTTCGGTGGTTTCC 
TCAGAATCTTTCAGCGCCTTGATGCGCTTTGCATCCACTTCCGCC 

SEQIDN0646 

GGCTCATATCGATTATGGATCAGANATTACCGGAGAAGAAAGATTTTTACCTTTTTAGACTT 
ATACTAGGGATGAAACTCTNCTACTATATAAAGAGAAAGGTTTTCTTTTGNAACATATACTG 
G AAC AT G C AAAT C AAAG C AAT AG G AGT TTATTTTCTGCC 

SEQIDN0647 

GCNNTGGCNNATCCCACTNTATGGGCGGTAGCCAGGCGTATACCGAGGTCGGACAGATCACT 
TAGCGCTGNCGGGGGAAAAGGGCTTTGCATAACCCTNGCAGGACTCGTTTGNTTTACNCGCN 
TGNAGTNAGGACCTTTGNTGCGAGGNAGCCCGTAAAGCCGAGCAGCAAAGNCATATTCCTGA 
GCTGGTNAAATATTTCNGNCNGACNGGCCACGTNCC 

SEQIDN0648 

CCNGGAACCTATTGACTCGACCTCAATCAAAGAAAAGGGATGGTGATTTCGCTCCATTTCCA 
GGCTGNTTCCTGGTGTTCAAAGGGTACTTTTGAGTGGCGTTTCAGGNGGNCTTTTTTAGCAA • 
CGACACAACTATTTCGAACAGAGGTTTCAGCTGCGNTTCGAACAGTTTTGAGAGNGATTTCT 
GGNGGTTTNCGGGGCTAGAAGGATGCTGGTAGAGTTCTTGTCCGAGGTTTTGACATTTCAGA 
TTCATCGAGGTCTATTTCTTCCTTCCTCACGTTGTTTGTGC 

SEQIDN064 9 
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CNCATCACCAAAGCTATCATCTATGAGCTGGNGGAAGGATGGAAGCACTCCATGTTTCACAC 
TGATTGAATCACCCGTCCTACCAAAGCATTGATGTCTTCTTCTTTATGATCACAGGCACCCT 
ATTAC 

SEQIDNO650 

TCGTCTACGGANGATTGNTCAGGTACACGCTTCTGAAATTATGGATTGATGTACGTTTGAAT 
TGGAAGTTGAGTTGAAGTAAACAAAGNAAATGAATCGTTCACCTACTTTCACAATACCTGTG 
T TT C AAAT G T AG C AAT AG G A 

SEQIDN0651; 

CTACGGNNAACTCCTCATCTTNNCCCTTCTACTCCTTTGATGTCCAGAGCAACATTTTCCGG 
TGCCGGAATTGTGAAAGGGAGGTCAGCGCGAGCAGAATCACCAGCCATTGTGGCAATTTGGC 
ATAGTAAAAAGACAATGGAAAGGAAGGATGAAAGTTTTCGA 



SEQIDN0652 

CGAATGTCCTGATTGCACTGAAATGAAATGAAGAGGAAGCATATTTTTGTTGAAATTTCCGG 
TGGCTTCAATGCTNTCATTATAGNTTTGNAATAATTTTGGACTGNATTGAACTGATGAACTG 
TTAGGCTTGAGTTTGATCATTTGGACTA 

SEQIDN0653 

CTGGTGTCGAACAATGTTCAGNGTTTTCTTCTAATGGTTAGTTCAAGTTGTTGTGGATAAAT 
GATTATACTGTGCTNTTCNTAAACATAGGATGCATTTGTACCAT 

SEQIDN0654 

TTCTNCGGCAGAAGTCAAGCTATCTATCAAGTGCACTTGACCATGATAAGGCGACAATCCCG 
GAGGGTAACTCTAGAGGAGGTACATGCTCGCGGCTTTGATCTCTCAGCCGATATTGAAAGGA 
CGAAGATTTTGGAAGAAGAGGCTGCCACTCAGCTTTCTGATGAGGATGATTCAGCCAGTGGC 
TCTAAGAGTGGAGGAGACGAAGATGAAGTCCCCGAGGGTGAGGCTCTCGAAGATGCGGCTCC 
TAAAGATGAAACTGCTGAAAATATGACCCCGAAGTAGTTTTGGGTTTCCTTATTTTGTTTCT 
GTTCAAGTCTCCCTTATGTAAATATCTCCTA 

SEQIDN0655 

GNNTGCTCTTGATTTTTCTGAAAAATCAGAAGAATCATCAGTGTGTTCCTCTGTGGTGTCAT 
ACCAAGGAGGTGAGGCTGAAAGTAAAGAGAATGACGACAATTCATCTATATGGTCAATTCAA 
GTGAATGCAAGTACTAAAGATGATGAAGAAGATGAGGAAGAAGGAGGACTTGAAGAAGAAGA 
AGAAGAATATGATGATGATAACTATGATGAAAATGAAGAAGATGGAGATTTAGTTGATGAAC 
TGTGTGAAGCAATTAGCAAGA 

SEQIDN0656 

ATGAGGTGTTGGGTTACATCTCTATTTCCCTTTTTGTACCNTCCACGTGGACACTTCTTCTC 
CTTTAGTTTTGATTCTTTGTCTGCAATGCCCCTCTTTCC7\ACCTCTCAAATGCCTGGACAAC 
AGATAATCTCGTTCTTGTTTGNTGCGACAAATGTTGTTCATAAGTTGTGTTTATTATAAGAT 
ATTGAACATCATAGCTTCCACTTAGTTCTTTAGCTAATGTGAAAGTTGCTTATGG 

SEQIDN0657 

ACGTTGAGAGCCGTAAGCCAGAAACTGGAGAGGAAGATACAAATGCATCTGCCGGTTCAACT 
GGAGTTGATAGCATGGCTGATAGCATAAAATCATTCACTTGTAATCAGAATTTTACAGATAC 
TGAGGCTTGCACGTCAGCAATAGGTCTATCAGCTCATGATGATCAGGCATCAGATATTGCAG 
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ACCCTGAAGAAGCTGCTGTGACAGAATCAGCTGTAGTAAGTCAGGAATGTGCCTCTAATTTG 
G 

SEQIDN0658 

GGATGAGAGAAAGCCAAGTCGGACGGTTTGGTGAANCCAGAACTAATTCAGCAGATCGTTAT 
AGTGGACAGAGAAGCTGATTTTGAAAATGCTCTTCAGAATGGTGGGGGGAAGATAGCTCCTG 
GTGGTGT7\ATCAGTGTAAAATCCAACAAATTC7\AGCTTGAGAAGCATTTNGAGCCGGNGACT 
GAAAAGAGTGGCNAGAAAAAGAAACAAAAAACCATTCTGGA 

SEQIDN0659 

AGCCTGNCCTAAACCAGTNTTCGATCTNTGCTCTGCTGCCATTTGTNGAACCATTGGCACAG 
TGGAACTGAAAAGAAGAACGCGTCCATGCTGTCCTTGTCCAATCACTGTCCA 



SEQIDNO660 

AAAGCAACTGTTTNTTAGAGTNCATGGGTTTAGCCATGGCCCATNCTTNATTAGNCCNAAAC 
ACTCCCNAAGATATNGATATTGGNCACAACAAAGGCCCGTGCAGAAGATGGTGTGCCACTGC 
CACCA 

SEQIDN0661 

ACGGGGNNNTTGTCCCATTGACGTATCTCACAACTATTTTAANNGNCAAACCCGAAGTGGTA 
TGTGGTGTGGTCTGCAAATATGAACNCTCACATTCTTCCCGNGGTGCGTAGTTAGCTACAAA 
TATGGACGTCATATGTCAGGTCAAGCAAATNGTGCTTCATCCATGAAGTGGGCTCCTCATGC 
TTCAAATGCAATGGGNACA 

SEQIDN0662 

TTGAGAAAGTTTTGTTTTTAAGACNGGTTGCTNGGAAAGNATGGNNGTTGGCCA 
SEQIDN0663 

TTNNAATAGCCATACAAGGTATATCGGNGGTTANTGCATGTTTTTNAACTTATGGNNCACNC 
ANNATTGTTGTTGATCCANGGTCACAAANAGNCAAGCNGTCANGNTGNANGAGANAANTNAA 
NAATGGAGGCANATGTGGNGATGTANNTACCAGTTGTGAACAATAISIGACATGNACT'GTTCGN 
CATGATTGGCACNATTTGTGNGGNGAATCCNAAGCAA 

SEQIDN0664 

GANGACCCTATGCTGATGATCCCTATGCGTTTGGCTAGAGGTGAAGATGTCCCACTCCAGTG 
CAGAGCTTCCTAGAGAATCTGAAACTTTGACCTGGAAATGTGTGTGCGCTGATTCTTTGATT 
GCAGACGTATAGCTGGCTGCTTTCCACATTGCAAGGAACTAGAATTTTACTTCCCCCA7VAAA 
TAAAACTGTATATAACTGCAA 

SEQIDN0665 

CCCTATGCGNTTGGCTAGAGGTGAAGAATGTCCCACTCCANGGCAAAGCTNNCTAGAGAATC 
TGAAACTTTGACCTGG7\AATGTGTGTGCGCTCNACTTTGATTGCNNTACGTATAGCTGGCTG 
CTTTCCACATNGNNAGGAACTAGAATTTTACTTCCCCCAAAAATAAAACTGNATATAACTGN 
NATTACTCAGGACTCATNATCCTCCTGCTCAAGTTGCTCAAGTTCCTGGAGCAG71AGTGATC 
CCTGCTCCAGCTCCTACTGGCTGGGAATGAGACCTGCTTCCTTTAGAAAGTTCTTTTTGA 

SEQIDN0666 
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GANNGNCGTANACGAAGNCAGGGGACTGAATCATNAAGTATGCACAACGGAGCTCTATTTGT 
TNGTTCCACCNTGTGTTGGGNGGGNGGAGTGGCTNCCTANTGATATGTATGTATNNTNNGAG 
CCAAAGNTCATATTATACTTAANCCTACTGNGCNCCTATAAAGAGAATGCCGCGAGATTCAG 
AAGATGCTTCTGATCTGTGA 

SEQIDN0667 

GGAGGCTAATAAGTTGAAGGCATTGCAGAGAGCTGCTGCTCGAACCTCTCATATCAAGTCTA 
CGTGATGGTTTTCACATAGAGCTCCATAGAGGTTTCTAACTAATTATATCCTTTCTTATTGT 
AAATGCTTCAGATTACCTTCAATCTTGAACGTCCAGAGACTTGTCCAAATGATAAATCTTTT 
TACTCTTTCACCCAAATTGGATGTCATTTTCA 

SEQIDNO668 

AATCTGAAGGGTCAGAAGAATCATCAGTGTGTTCCTCTGTGGTGACATACCAAGGAGGTGAG 
GCTGAAAGTAAAGAGAATGACGACAATTCATCTATGTGGTCGATTCAAGTGAATGCAAGTAC 
TAAAGATGATGAAGAAGATGAGGAAGAAGGAGGACTTGAAGAAGAAGAAGAAGAATATGGAG 
ATTTAGTTGATGAACTGTGTGAAGCAATTAGCAAGA 

SEQIDN0669 

GCCANCCCAGTCGACAAGACCAGCGCCTGNACGTAAAAATCTGATACCTGACTAAGCTTATG 
TCCTGAGGGAGCCAACCTCCCTCAGGCGTCTGTTACTACCTGCTGGCTT 

SEQIDNO670 

GCCGGCTCTGNGTCCACCTGACTATCAGAAGCGGCNCAGATGATTGCATCTGTATTANAAAC 
AANGGAATCTCCATCTTCCATGANTGNGCCTATAGACATCTCTCTATAANTCATTTTTTTTN 
CTTNNNCANAAATNGNCGGAGATACTNTAGCTTCATNANTNGT 

SEQIDN0671 

GGGCAAGTGGATGGTGGGTACTGNCNCGTTCGGAGCTCGAAGGTTTCTGNNNCTGGATTGNC 
TGTCTATACCATTATGTGATGTNACCNAGATGGCATCGCATCTTGAGGCCCACTCTCATCTN 
GCTTNTG 

SEQIDN0672 

GGNGCNATTGCCNAANTGTGCTTCTTGCTGGATATCATGTGTGAGTGTTATCTTCAAGAACC 
TCACA7\ATTTGTAGTTGATCAGAATCTTTGCAATGCGTTTTCTCATTTTCTTTCATTTGTGC 
TTCCTTTATTTTGTCTTTTACG 

SEQIDN067 3 

GGTGCTGAATTGGAGGAAGGAGAANAGGANNNGGANGAGGAATGCCTAGNNGNNNGNGTGCA 
TAGANTCCAACTGAGTCACGCAAGAAACCAGTNTGTTCCACTGNTTGGCTTNCTGCTAGGGN 
TGTTGAGTCTTTGAATAAGAACGTTGATGGN 

SEQIDN0674 

CACCATTCTTGATCGTAGTCCGAGATTCCACGGTGAGCTGCTCCCTTCCTATGTCGTTCAGC 
AGCATGATGGAGTCTCTCTTTGCTTTTGGTTGTCTATTCTATTTCAGACAGTTGGATAGATT ■ 
TATTCTTTTATATATTCTGCTAGATGCCCATATACTTGTGACACCAGGTCTTGACACACACA 
TTAGTAGACTATTCTTTTGGGATTGTATAATTATTATTGTACGTTGCTAATTATCACTTGGT 

SEQIDN0675 

GGGGGNNGNTTNTCTCTCCGCTGGAAANNTGANTGACTTGGGTGCTAANTGATGGNAGACCN 
ACACACCCAANAAGGGNAAGNGGAAAGGACGACATGGNTCAATAGCNCAGNGAGGGAGACAG 
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ACGGAATGAAACGANNCAAGANANTGGGGNNACCNTGTTCTATTTANTGTGNNAGNNNAAAC 
AACCCACGTTCCTNACAAAACAAACAGTATTTTGGATCGGAGACTAATCTGAATTTTCCAGA 
CGAGTTTTTTNCGGTNAATCTNGAGGTTCCGACATGGNTTTTTG 

SEQIDN0676 

TAGGGAANCNATNCTCATTTGTTATGACCACCATTTACTTAGCT 
SEQIDN0677 

CTCNGNTANCAACACGGCTGGATAAACTTCAGNGCTCCCGGTGTGGGTCTATTTATCGGAGT 
TTGAGCACGACNNACACCCCGGGACCATNTAGNTAGGATNGCTCATTCANGAATAGC 



SEQIDN0678 

TCAGAATGCGAATTTGCCTACTCAAATGAACGAGATTCCTGCTAAGTGGAATGGCAATCCGG 
AAGGTTGTAGTTTTGTTCGTCCAAGCTCTTTCTCGGCTTCCTCATCACCTGCAGGTCCTTTT 
AGATCATCATCTTTGTATTATTCTGCCGGCTTTTCATAACCAAGAATGTTGCCTTGCATGGG 
CATTTACTCTCATGACAGACAATAGAAACCTGACGCTTACAAAGCATAAATATAGCAGTCTG 
AACGAAAACACACACGGCAAGTTTGAGCAGATGAGTTATTCTAGATTTGCAGGTTTTGCT 

SEQIDN0679 

TTTGGCCATACAAAGGGNTGAATATGAGGNATATGGGGGGNTAGGCATATGTCGCACAAACC 
CTGGNAT 

SEQIDNO680 

ACCCTACCGGGAGGATCATATGAGCGTGGGTTCTACTGGCCTCGACGTCCTCTGTAGTTGGA 
AGGGAAACCAT 

SEQIDN0681 
GGTGTTTTAGGTTGTCT 

SEQIDN0682 

TNGCCCCNGCCAGTCGGACAGAANCGGNTAGNACCGAAGNCNATNCTGCCACGGGCANGGAA 
GACGT 

SEQIDN0683 

AGAGGTGGTGGGACTGTTCGTTCGGTGCTCGAGGTTTCTGGTTCTGATTTCTGTCTATACCA 
TTATTGTTGTAACCGAGATGGCATCGCATCTTGAGGTCCACTCTCATCTTGCTTATG 

SEQIDN0684 

GCNGNGNNCAAGGNGGCTACCTGACNTNACTNAATAAATCAANCTNTTTGAACTCAGGGTNT 
ATAGGANGAGATGGAGGCTCATGCATGGTTGACACCAGGGTTACTGGAAAGANGGTTTATCA 
TCCAAACCATAACATTGACACTGAGGATGATGCACTTGCGCTGAAGTTGTCATCAACCACAA 
CCATTGCTTCAGATAATACGAGCTCATTATCTAATGAGGAATCAGCAAACTTAGCAAGTGTT 
ACTTCACTTTCTG 

SEQIDN0685 
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CGTTACATATTAGGAGTATAATTTTTTCATTACTAAAGCATGTAAATATGTTGCTCCGGGCT 
TTGGTCTATTAGTAAGAGCGCAATGCGTGATATGTGGG 

SEQIDNO686 

TCNAGCAATTANNNNNTTTGGCCTGCNGGTNCCTNTGGCGCTGANGATCTCTATGCCCCGCC 
GGCAGACGGTGGATTGGATGATGACAATGCTCACG 

SEQIDN0687 

TGGNNNNTCCTNNCNNGCCAATAACCAGCCCCNGGNGCTATCANCATAANCTAAAAAGANCC 
CCATACANTCAACCTGGCTGGNCCATCACTTAGGGCNNNGTTTCAAGATTATCCAACTTGGG 
NAATACTTATCCGCCANGATCNATAGCCGGATCAGACNGACG 



SEQIDN0688 

AAGACAGGGATGGCAGTGCTGAGAGGAGGGCAAAGATTGAGCAATGGAATAGGGAAAAAGAA 
GAGGNAGAATCTGCTAAATACAATAATTTTGACACTGATAATGGCAAGAGTGATGGTGGTGA 
TCACTATGGAGAACAGTTTGATGACGATTACCCGAAGCAGCAGTAGGTAGCAAATGGGAAGT 

TATGGGCTACTGATAGTAGTGGTTACTCTGG 

SEQIDN068 9 mM 
NAANCCCAGNANNATTCNNGANGCAAGGGTTGATAGCGACTATCANGGCTGATGATTTTTCA 

CCGNGCTTNGGCGGGAGTAGCCTGTGCTCATTGACNGGAACCCGTNTCGCAGGACCTTCGCC 
ATGAATCGNTTTCTCGCCATTTCCGTATTGCTCGTCANCTCAGTCCTTGCCGGTTGCGCGAC 
ACATTCGNCGCCTGAACTGCGTGCCTACTCGGCGGAAGAGAGCAAGGAGCTGGCGCTGGAAG 
CCCTGAGCCGTCGAGGCCTGTCGTTTGATGAATACCAACAGAAGAAAGCCGAACTGACCGGC 
CAGCCACAAAAAACCTTTGGTTTCGACCGCAGGGTGAAATGAATGNCGAGCGCGGNATGACG 
CTCCACGGCGCCCAGGTGAGTTAAGTGACAGGGCNTGAAAAGCCGAGGGTTCCACANGAACC 
TCGGGTTTTTGNTTTGCCATCCCGTTTCCGGAGCCTG 

SEQIDNO690 

CCATNANTTNACANTGCTGGNNCATNNACAACCCGGTGCGCGGTTCGCCGTTGCGCGGCAGT 
TCCGGC 

SEQIDN0691 _ m 
CGGTACGAGAAGCGTGTGATTCAAAAACAAGTGTGATCATGCAAAGTATTGAGATGGAATCT 

TGGAATGCATGGAACTAGCGTTAGATTTGGTTGAAATTTGTAATTCTAATCGCAAGC 

SEQIDN0692 m «„ m „» 
GGGCCTCCTAGCAACATTTAGGAACCGAATAACAGCACTTCTCAGTCTATACGGCATCCTGA 

TTTGTTCATCAGCTCGTATTTCACAGGCTACCATATCACCAGTGTTCCATGCTCAGCC 
SEQIDN0693 

CCTCNGGCAGGTACTCAATAGCNAACAACTTTTACATCCTCAAATTAGCACAAATCTACATA 
TTTCATATACAGAACACTATAGTAGAGTTCATGTTTAGACTATTGCCAAGTCTGCATGATCT 

AAACAACAACTTCCACC 
SEQIDN0694 
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.TTTCTTGTTGCTCGTGAAGAGCCAATAACCAGCCCCCGCAGCTATCAACATAATCAAAAAAG 
AACCACCATAGATCAACCTGGTAGGCCCATCACTTAGGGCAACTTTCAGATTATCCAACTTC 
GAAATCTTATCCGCCATAATCAATAGCCGCTCAGACTGACG 

SEQIDN0695 

TGGTCGNTGNAANAATTTTGCTGGAAGCTTTGTNNAATGAAAAATTGNTGCTTCAG 
SEQIDN0696 

GCCAGCTAAGTGGCTTTATAACACCAAAAGAAAGAGGCCTTAGGACAACTAAATATGACATA 
CACTTAGACAACATGAATTTGCCAATTTATCTGTTACTATTTCCATTGACCTCTAAACTCAC 
CTCCATGCA 

SEQIDN0697 

NCGTAGCTATCTTTGCTGCTTCTTTGATGCTTTGAATCATCTTTGATCTGTGACGATATTTT 
GTGTTTTATTTCGCCGGAGTTGAACAGTTAGGAGTTTATTTATGGNTTTATTTTTCACTGTT 
TTTTGTTCATTCTTTTTTTTACTTCTTGACA 

SEQIDN0698 

GATGCATGTGTCACAGAAGAGATGCCATAGTTCCATATTAGGAATTGATAAGATGTGCTAAG 
ATCAATATAGGTCACTTAGTATTATTCTCTTCTAGGCACTAGTTTCAGGTGATATTTTAGTT 
TTATGGGATGCATTTCGTAAACTTGTTCTTGCCTTTCAGTTTCATTTTGTATGTATATGTCA 
CTGGTCCATATTGTTGTTGACACTCGGCA 

SEQIDN0699 

GATGGACGTGTTATTGGTGGTGGAGTTGCCGGNCTATTGGTAGGCTGNCAGTCCTGTGCAGA 
TTGTTGNGGGCAGCTTNCTTGATGGAATTCAGCTCGAGCAGACGACCAAGANAAACAAGTNC 
GAGCCCATAGNTGNAGCTGNTCCTCTATCTAGTACAGATATGGAAANCGCCTATNACTCATN 
ATNAGCANAACCAACTGTAGNATCGGATTCTTCCTTACATGAAGATAACTGNNCATNATTAG 
CCCNCGACTTGAGGAATANNCCTGCTGACATCAATGNATNTAANCCTGCATAGGTTTTTGTN 
GAANTGNANTTNATCNG 

SEQIDNO700 

NCTG7y\GAAGGTCCTNTCGGGANGAAATAGCTAGGNNGTCTTNGNTTCANCT 
SEQIDNO701 

NCTGATTGTTCTTACAAATAGGTCAATCTTAGTCCAAGTAAGTATATTCTCTTACTTCTGTA 
NTTTTCCAGATTTGGT 

SEQIDNO702 

TAAGGGATCACGACCCTACGGGAGATCATATGAGCGTGGGTTCTACTGNCTCGACGGGCTNT 
GNAGNNGGANGGAAACCAT 

SEQIDNO703 

GAANGAAGGTTCCAAGNGNCTCCCATTGTGGAGCANTATCACCTACACATTGTAGGGCT7^AT 
TATCTTTTCACTTCACNCGGTAGAGGANCAGATTGCATAGCTGT 

SEQIDNO704 

NNTGATGTCCCTTCCTTTCTGGTGTCGTATCCGGCTTTTTNCGTGGAAGCGGTGTTGCTAAA 
TCGNGTGTCCGACGGCCCTTTACTGTACTGGAACGACATTTCTCATTTTGTTGCTGCTGTTT 
ACGGT 
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SEQIDNO705 

GATATCTTCATCTTTGCGCTTTATGTTCTTCACATCCACAAGTATTGGTGTGTTTTCTGCAT 
TATCATTTCTCAGTAGTTTCCTTCTCTGTTTCTCC 

SEQIDNO7 0 6 

CGAAGAAGAAGANACTTACG 
SEQIDNO707 

CACGGAAATCACGCCGNCNTTGGTACCTTGACCGGGTTNCCTANAGGGNACTTCAGTCANTG 
GGNNGCCCAGNNACTGAGGNGGCCG 

SEQIDNO7 08 

CATCTGGATTNAACAATTTCATGGCCAGGTTTTCAAAAAAATAAAACAAGGTCTTCATGGCC 
GTGC 

SEQIDNO709 

NAACCTTACTGTACAAAGGAAATCATTGGTTGCTTGGGATAAAGTCTGCATGCCCAAAACTG 
AGGTGGCC 

SEQIDNO710 

CGGNNTTTTGACAAAGGTTCCCGCTTACACACTCCTCGTNCGATGNGCTCCCTGACCCGAGT 
GTTNTCGCGCAGCAGTGTCATGNTCAAAACCAGGATTGNNTTNAAAANGACAGGACTTCAGG 

TCATTNATTCCGCC 
SEQIDN0711 

ATTNTNNTTTTTGGAATGGTAAATACAGGTTGGATAGAAGCTTTCCCA 
SEQIDN0712 

GCAAATANTTATANGAAAAGGTCAAGGAAACACTAAGTGTGTCATAAATAGGATTATCTATT 
ANT A 

SEQIDN0713 

CTNNCTTTGNTNGACGAGAGTAANANCTTGGCAGCTATCTTCCAAGCCATTTTCAAGGGCTN 
TGCATCTGTAGTNCTNTGCA 

SEQIDN0714 

CTGATAGTCTGATGGGCTTCCCTTTGAGGGTAACCCGACCTTTCTCTCTGGCTGCCC 



SEQIDN0715 

GANCCAGCGGNANTAGCTGCTGTACTANNNACAGGNATCCAANATATGAAAGCT 
SEQIDN0716 

AGCACNTCCGGCTGTATCTTACTACCAGAGAAATTACAGNTGTGGACATATCTCGAAGATGA 
ATCAATNGAATATATCTNCTAATGAAATTGTCTTGCTCTTTNGTTGNGTAT 

SEQIDN0717 

ACTGGTCCAAAAGCTNCAAAAATTTGTCTAAGCTGTACTNTGNCATGNNGAAATGNAGATTT 
CCNACATAAAGTTTTCTCTCTGAGGCAGCATTTGNGCCTGCCAACCCTGANNCACCACCANA 
CGCAGTTGACTGAACAAGGGTTTTTTCAAGCTTCAGAANGCNTTACCAATNNTGGGTNGNCC 
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AAAAANCAAGGCANCGGGTAAANGAATTGGCCATNGGNCCAANCTTNGNNTATAAAANNNNA 
ANGTCCCNAANTCNTTTANATNGCNTNGAATNCCGGCCNNTGA 

SEQIDN0718 

GATTTAGTGATNAATTTCCAGCTTATTTTTTGNTGTGAGAGGAGNGCAGTATCAGNACTCCT 
TCTGGCGCCAGGATACCATNAACAGGTAGCCATCGAAGGTGTACA 

SEQIDN0719 

GGGAAGNTCCAAACAAAAAAGAAAAACGCAGTAATACCCTCCAAAAAGCTTCATCTTTCTCA 
CCAAAGCCTCTTTGCTTTGGCCATAGAAACCAGTAACCATTAGCTATGTAAAACCATTGCAG 
CTACCATTTTAGAAACAGTTTCGAAACGCCA 



SEQIDNO720 

AGCCCTTNGTCAGCCCACCTNTTATGCTCAATCNCACCGNNAGGAANNCTGNNAGAGTTANN 
GANGCGATTGATTNCNGCNCTGACAGATCATATNGCTTCTATAANNGTTGNGCGGACACGCG 
AATNAGNTTNCTTACCCTCGCATAAGACANATNCTGATCTTACCAACCACTCATTAGATGTG 
GNACCTACAGCANCTACATCTTCTACTGCTGCTAACA 

SEQIDN0721 

AGGCATTNTNCAGNGCGCCACAAGTATACTGGATTTCCCGGAGACATGTGACTGGAGANGCA 
TCACCGCAAGATTTGTCCGCTCAAACTCTCATTGATGCTGCCATTGNCATACAAAAGTGNAT 
TCAGCGGGTGGATAGTAAGGTCTTCTCTTGTAAGCACGGACCAATCTCCTACAGTTCCTAAG 
GAATGTGAAGAGAATATAAATGCAGCGGNAGCAATCCAACATGCTTCAAAGGAATATACA 

SEQIDN0722 

G G AAAC AT AC AAC G AAT G C C AAAAT C T G C CAT T T T G A 
SEQIDN0723 

CTTTCCG7VATGCTACCNGAT1SIGTATCAATTGGGGTGAACTGGTTGGGGTTTTTTTCCCCCTT 
TACC 

SEQIDN0724 

TCNTTGCNGANATCTACCTTACATGTTCCTGATGCAATCATGACTTACTCTGATTTACACAT 
GGGTTGCTGNGGGCTGATTCCATGTCC 

SEQIDN0725 

GCCTTAAATGGTGTGTTCTAACAGGCTTATGGGTATGCTGGCATTCTCCATTGCTGGGCATA 
CCCACAGCCTGCCCTTGCCTCTTTCAATTTTCCTATTCCCCC 

SEQIDN0726 

GCGNTACTTCANAGTCNNGGANAGAGGCTAAGAGGNCNNACANNAANTGCTTCAGTACTAAT 
GAANCANATNCTNGNNTCTTTTTNAGGGACATANCAGGTTTTGACAAGCCCCCACATGAATA 
AGAATATATNANACTTCTCTAACC 

SEQIDN0727 
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TTACTGTTTGTCCTACCTGGTGATGGATAGTTTGGGTTCTGAATAATTTGTGGGATGCAACA 
ACAAGCTTTTGGTTACTTTTTGTNAAGTACAGTGGTTACTTGAACTAGTTGTGTAATATATG 

CTATGGTAGTGGTCGTATCTCGAAACACGTGATATTTAGTGC 
SEQIDN0728 

GCCTTGGTAAGACATTCGTGAAAAAACTCTGTTATTTCTTAGAGATAAGGTGGTTCCCGC 
SEQIDN0729 

GACTCTTGCCAAAATTGTATCTAAATCCTCATCTTCCTTTGGGATTGGCCAAGATTGGCTGG 
CAATGTTGGGGCATTTTTGTTCGAGTTGTTCATGTTGGACAAGTGACTACCTGATTTAGATG 
TTGCAGAGCAAAGCTGTGCGATTGTGTTGTATGTTATTCTTCTA 



SEQIDNO730 

GCCCGACGCTTGAAAGATTGCATTTGGAGAAATGCCAATTGAGAAATAAGGAGAGCTTGAGA 
ACATTGTTTCTACTCTGTCAAGACGTCAGAGAGGTTATTTTCCAGAACTGTTGGGGACTGGA 
TAATGAAATGTTCAGCCTTGCCAGNGTTCTAAGGAGAGTGAAGTCCCTTTGCCTGGAAAGCT 
GTTCACTACTCACAACTGAAGNCCTTGAGTCTGTCCTCCTTTCATGGAAGGAAATCCAGAGC 
CTCAAGGTGATTTCATGTGGCAATATAAAGGATAGTGAAATCAGTCTAGCACTGTCTACCTT 
GTTCTCCGCACTAAAAGATTTACAATGGAGACCAGACTCAAAATCTCTTCTTTCAGCTGGTG 
TTGNGGGAACTTGCATGCGGAAAAGAGGCAATNAAATTTTTCAAAGAAGACCGTGNGACTTG 
GAAGTCACTTGCCTGGGNGCATAGACTGGCCTTCCTCATCATGCATTCAGGACAACTCTACT 

ATAT 

SEQIDN0731 

TAGGTTTTGTTTAGTGTTTTCTAAGTTCTTGTTTT 
SEQIDN07 32 

GAAGCNACTAGTTCAAAATATGTGCAGTGTTGATCATATTCTTTTGTTATGGCCAGTTTTTA 
CCATTTGTTGGACACGTTTGATGCTGT 

SEQIDNQ7 33 

GCCCGTANNANGGGTTCCNACACCNNCNTANGGTCCTNTTTTCCTTCTGAATNGAGCCTGCG 
ATAAACTCCATANANAACTAGCAAAAAGAGCTCCATTTTTTCACTAAAAACAACCGTTCAAA 
CAGCTATGANAATCCCTCTATCTCCATCAAAACCGCAGCATCCATCATCCTCAATAAAGGGC 
TGCACAAACCTGCTACAATCAGCATAAAAACAGCCCTGAAACTAGCTTCTTTCGAGCTAAAA 

TCAACT 
SEQIDN07 34 

TGTGCTAANGTAGCCCGNTCTTATCAATAAGTGCAAAGTTTGG 
SEQIDN0735 

TAATAAAGCCCCGGGANAAGNNAAGAAAAAAAAGAGAAAAAGAAACTAGGCCGGGTCAAGGC 
AGGCCATATTGNNAGCACTACTGCCTGG 

SEQIDN0736 
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TGNGACCTTTTGAATCTCCCGAGTCTGNAGGTCTAGTTTACTCCCAATAGACGAGTATCACT 
ACAAGTCTACTGCAAATGGTTGATGTTTGATGTGGGAGACGAAACGATAAGCAATTTAGTAA 
CATGTGTCCTTTTTCACGTATATATAGATAGAGCAAGAATGAAAATGGAGACACCTTTTCCA 
TTTTTGAAGGATATATTGCTGTTTCTTCCCTCAAAGAGAGTTTTGTGCACTATGTTTGGTAG 
CTTTTCGAGAGTAGTATGTTTTTATCTCGTTGAAGCAACCTCCTTTTTTCCCCCTTGACTAG 

TTGACTTGAAGGG 
SEQIDN0737 

TGTTTGTTTATGACCCTGCTGGGTCATTGGTAATTATGTGTTTAGTACTATGTCTTGGTGC 
SEQIDN07 38 

CTTTCGTCGGAGCTTTNGCCGCCGCCGGCTACCATCAGTACAATCCTCCGNGCTGGGCGCCC 
SEQIDN07 3 9 

TGAATCACCTTTTAAAACCACGGGNAAAAGTAAAAGTAAAAAAAAGNAGGAAAAAGGAAACT 
AGGCCGGGTCAAGGGCAGGCCATATTGACAGCACTACTGCCTCG 

SEQIDNO7 4 0 

TGCGAGACATTGCAACTAAGCAAGCTCTTTCCCTACATTGNCGTATCCCAGCACACAGATAT 
CACGGGGCATGGAGCCATCCNNCAGTGTCAACCAGTGCGCTATATAGGCGGNGACATGCGGC 

GCG 

SEQIDN07 41 

GNNNGNGGAGNAAGCAAGCATAGAAGGAGCAAANTGTTCATTCACTGTGAGTANGAAGACAA 
AGCAAGAAATAATTCAGAAGCTGATTGAAATAGTAAATGAAATATCAAGCA 

SEQIDN07 42 

CCCTGCCTGGGAAATGGTCAATTTGAGGAAGGGCATTGGCAGCTAACTTGTTATATGCGCAA 
AGTCTTGTATGACATAGAAGTAGATGGCAACAGACAACAAGTTCCACCAGATGATTCCAAGG 
TTCAAACTCAAGAATCAAGGTGTTTTTATGAAGGAAATCAACATAATGACAATGATGGCTAC 
TGGGACTATAACTTCTTATTTGGAGGTGCAGGTGGAGGAGAACATAG 

SEQIDN07 4 3 

CCTCAGTCTGAAAATTCCAACACCAATATGCCCCAATTTGATTCTAGCTTGACCCGTAACAA 
TATTGGATCAACCCCATATTATGGAAGTCATGAAAACATGACATCAACTAATTACCATATGG 
NGANTTATCATAATATGGTGCTTCCCAAGGAAAATATGTCAAATTTTGAAGAGGGTTCTTGT 
TCAATAGATTCTTATGACATGCAAACAGATCATCACAACAGTCGATGGACATTTCAAGATGA 
TGGAGATGACCTTCAGTCAGTGGCTTTCAGATATCTTCAACATTCTTGATCAGTANNTAGGN 

C T T C AAAAAC AAAT CAT GGGT G AAG A 
SEQIDN0744 

TNCTNTCATGGTGNGCCTACATTCNGGACACNGTANTGATCCTNGCCAGCANGATTGTCTTA 
CGCTACTACANTTGGANCGATNNGCCTTACCTGNCGGTTTTANTNNGAGGACAATAAGNTCG 
ACCNTCCNATCTGCCTGAGCATTNNNNCTATGATGANCGATNGGGAGGNCATTGTGCCATCT 
GCGAGTTGAANGATTATCCACAGTGAGAGCCGGAAACCCCTGCAATNCNANANTCTGGGT 

SEQIDN07 45 

CCTTNNGTGGCTNGNGNTGTGCTCTGCGT 
SEQIDN07 4 6 
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GNTTGTCCTACCTGGTGATGGATAGTTTGGGTTCTGAATAATTTGTGGGATGCAACAACAAG 
CTTTTGGTTACTTTTTGTCAAGTACAGTGGTTACTTGAACTAGTTGTGTAATATATGCTATG 

GTAGTGGTCGTATCTCGAAACACGTGATATTTAGTGC 

NCTTTG^TTTGAACCACTACCTAATATGAAAGAATGCCTGCTCGTAATGAAATACTTGTCA 

tgSgtctctaccgagtcctttggctagggcaactcaatcaatatgcagtcgtaagaa^ 

TTGAAATGCATATGTAGTCATCATCGGTGTTTTCACATTTATGTGAATTTGGATGTTCG 

cc?gc??gaggtccattcttttttctcctttnttttagttcgataacactatatgcgggtct 
ctgatggttgtcgcgtntttttgggtgc 

?t?ggaatacaattcaacttctgtttcctaaagaaatagaagcaagaaaagcagctggagct 

TTGAATAGTAG^GA^G^TCGACGCAAAAGTCCAGTAAGAGCTGCTACAGCTCATTCTAACAT 
CTCTAGCAGGAGAATATCAAGAGTGTTCGCGC 

CAGTATCCCCCTTACTTGTGTCAAATCANCTTNTCCCAGTATGGCTTCCATATTTTGACTAC 
AATTCTTATCAGAAGGCATGATAGTAATAAGTGACAAAGATGCAAAAAACATAAAAGTTGTC 

CTTCACTTTTGGTTAGAGGCTGAAGATGAACTTTCTAAGTTGGACA 
SEQIDN0751 

ttcgatcggtgaagcttctttaccaac 

?aSaagnaatgcngtnccaaaatacattgaaataattggcagccgaatactaaacttgatc 

ATGT 

?c?gaa?Jt?gtccgccaaattgtcgtgcataggaacagaacgagagccatcaatgccgtag 

GCGCCTTTCGCGTACCACATGACCCGAGAAAAAACACCGGAAAGGATTTCCGTGATTTGTTT 
CTCGGTGTAGCTACGCCGCA 

TTGGTNTTGNCACCTGCNAATGGCNNTACATGGAGCAGGGACGNNAATAAGTGGNACGAGTG 
ACCACATGAGGGAG 

CATCTCNTCCTCACTTCTTGAACTGTACGCCCACCCTTTTTCTTCTTGGNTNTGTTCTTANA 
AGTTTCTGGCACCTGCTTTTTGCTTCTATTATCATQAGCTTCTTCAGGA 

naSccaatatgccccaatttgattctagcttgacntgtaacaatattggatcaaccccata 

TTATGGAAGTCATGAAA^CATGACATCAACTAATTACCATATGGAGANTTATCATAATATG^ 
TGCTTCCCAAGGAAAATATGTCAAATTTTGAAGAGGGTTCTTGTTCAATNGATTCTTATGAC 
ATGCAAACAGATCATCACAACAGTCGATGGACATTTCNAGATGATGGAGATGACCTTNAGTC 
AGTGNNTTTCAGATATCTTCAACNTTCTTGATCNGTNNCTATGNCTTNAAAAACATATCATG 

GNTGATGA 
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SEQIDN0757 

CTTTGGGGCCGTTCTTGGNATCCGTCGAACTAGGGTGTTGAAATTTCTNTTTTTTCTTCTTT 
ATTGGGTTCTATTATCGATTNCATGNGATATTTTATTTCCTTATTTGTGTTTGAGTAATNGT 
TTTCCATGTTTGCTTGTTCGATTTCTACCACTATATAACCCCTCCCCAATTACCCTTTTGGA 

CAGACC 
SEQIDN0758 

GGTANCTCTNGGNCTGCGAANANGNCTCTNAGCCTTNCNCAAGCGNGCGCGAGAGAAGCGGC 
NNACNNAGCTACCGNTTCACCCGNCCGACTAAAANACAACAGNCGCAGACCTACTTTGATTC 
ANAAGAAAGGNGACGGNTTCGCNAACANGNANNCGGNTTTCTATCANAGGTGCNAGGGTTCC 

AAACC 

SEQIDN07 59 

CCTNTGGNGTTCTGNNAATTCTTGTACACANAAGGGCAAAACAAACAAAGGAAGAGCAGCAA 
AGTATGAGTAGAGCTTCAGTAGTACTAGTAGCTATTATGGTNGTGGAA 

SEQIDNO7 60 

GNGGCATTCGGANCGATGGATTGGTCTTCATAACATTCATCATCTTTACATTGCAGCATTTC 
AGAAG 

SEQIDN07 61 

TCAAAANTANTNNCNTNCTNGNNCTGCACATTGAGCATGTGCTCANCAACCTNTNTTGTGCT 
CNNTNTTCCCCTGAACATAGNAGTATGCAG 

SEQIDN0762 

TAGNNCCTGAGACNNAGNAAGAAGACAGACNGTCACTGCAACGCCNNANGNGAGCATGACNN 
GANCNGNGGNAC 

SEQIDN0763 

GGCACAAGTNNAANNGCCTGTNTCGAAGGTGNGGCAACAACC 
SEQIDN07 64 

CAACGTAAAGGATTCAATTCTTGTTTTGTTTGTTCATCATTGAAATAATTTTTTTTTAGTCT 
TGCATTATATGTTTGGTTGGT 

SEQIDN07 65 

GGCTTGGNGGNNGCGGGTGNCCACCATGNNATGCATACANTATNCATGTANGNNGCTACANA 
GACACATTNGGAATAATGNGTCGGATCGNTTAGNNNTGGG 

SEQIDN07 66 

CNCGATTNNATACAACCCTGAGAAAAGAATGTTAAAAAATGACTATCTTTTGTAAAGAAACC 
CCTTTCATTTCCAGGCAATGCAAGGGGGATCACAGTTTTACATNGTGGGTGTGGTTATTTTA 

CGTCACAGTT 
SEQIDN07 67 

ACGATCGATNANGTGGNCTNGNAACATTCANCATACTTTACATNGANATNTCANAGGTTACN 
CAGGNCTCATCANTGGNNNAGCCTNTGCTCANCG 

SEQIDN07 68 
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TCGCACACAGTATCATGAGAATNNTGGNCTTGTCATCCTCAAAAGAATCCTGTNANAGCATG 
G 

NT CT ACAATN GC AT ACAN CAT CAAGCAT AGNC AATCACAAACAT GT C AT GTAN AAGTCCT G A 
AATTTCGATGTCAGGACTAAGCTATAAGNACTACTACATGGAAAGCATATATGTGCATTCGT 

NGTCCAAGCAT 

SEQIDNO7 7 0 ~r-r*n 
GAGCCTGCTGGATCTTCTTTCTCTTAGCAAAGAGGAAAGGAAGAAACTAGTCGAAGAGCGCC 

CTGGAATCAATAATTCTACTATTACTGCTCTCATTTCTCTAAAATGGAAGGAATTGAGTGAA 

GAAGAAAAACAAGTGTGGAACAACAAAGCAGCTGAAGCATACAAAAAGGAAATGGAAGAGTA 

CAACAAATCTGTAGCAGAAAAGCAGAACAACAATTAGAAATAGTAGAAATAACTATAATATG 

TTCAACTGATTATGTTGAACATAGAATGATTGCTAGTTAGTTGAAGTAGTAAATAGGTATCA 

TTCCAATTTCCTTTGTTGTTTAGTAGCAG 



TCCGNTGCAANCGGNNCTTNCACNCTTAGCAANAACACNNTNCTGGGGATTNNAGTCATGCC 
ACAANTAGCAGGGGCTNAGNCGNCC 

GGTTCTNCTNTNNCTGCTGCGCCTGACAGCANTTGTGTGGNTCTGNCGCTGCACNCNNCNGC 
TGTNTACGC^GGAGGNGNAAANGGNTGNNCCTGNTNNGGAGTCACATGATGACANGNGTNAN 

ANNTNGTTNNA 

ANGNGCTATATCTTCGNNAGAAANACTGCTGCGCAGTGTGNAANAGCGTGNNTTCACGGTAT 
GNANGGNNGATNNNACTNTGCAGNAACTNCNA 

SEQIDN0774 

CTGTTGNTCTTTGGNCACATGATGATTCAGNTTGNNAAATNTGTGG 
SEQIDN07 75 

ATAGTAACGTGCCTCTTTGTTTCTGCNNTCAATTNGGCTANAGTCNAGTGGAGTAACGCGTG 
NGCCATTNTTNTNGAAGCTGTCGG 

SEQIDN07 7 6 

NTTTATGCCGGAANAAAGNNAGGCNAGNATGCAGATGCNGGNNACATAACGCTAATATGNGG 
ATGAATNAGGACNAGCAGCAGTGAAACTCCTTCCC 

SEQIDN0777 ^ ^ 

NGAGTNAAGGGCCANTCTGAATNTGGCCTAATNTGGNTAAANNGNGGGGAGTANGCCGNACA 

NANTNATTCTTGTGGNTGGNNNNNCGTTNA 
SEQIDN0778 

CTGATATGGGGATTNNGAGGCAAGGGGTATGGGGNATCATGAAGNTGGTTGCAG 
SEQIDN07 7 9 

GANNAGGCGCTCCCTCCTTNCTTTGTGATGACANCNATNGAANGAGAAGACTCCTA 
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SEQIDNO780 ' 

GAAGCATAGCCCNGCGCNGNTNGCGTNAATGAGANCACAGATGGNNCTAAAANATGANTGNT 



SEQIDN0781 

CCGCCTANTGCCTGTTAAGTCTAGCAACCTCCTCNAGAGTTNGGGAATTCACAATGGCAGCC 
SEQIDN0782 

GTANGGCCGAGTNAANGGTAGCAGAACTTNGAATGTGGGACNNGAGNGTACAANGCGTCNGA 
CANNGACTTNGTGTANANNC 

SEQIDN07 8 3 

GGNAGCGCTAGATGANCAAGACACAATTGATATGCAGTCTTAGGAANCTAGAGAGAGANTGT 
AGANTANGGTGATGAACGCACNTNGG 

SEQIDN0784 

TATTTNCCTGCGTGACCTAGTAAANATNGATAGGCCTCNANAGGTGGGGTTANTNAGGNCTC 
AT C AAT N C C NAG AC C C AAAT C AG G C AAT C 

SEQIDN07 8 5 

AAGCNGANNGACCTGTNTTGCACCTNAATATCCNNAGCCAAGGAAGANNGACGNTGGCTGGA 
TGANNNCAATNCTTNNANNAACCANNTACTGNCCN 

PRIMERS 

SEQIDN078 6 
CTCGTAGACTGCGTAGT 

SEQIDN07 87 

GAT C AC T AC G C AG T C T AC 

SEQIDN0788 

G AC GAT GAG T C C T GAG 

SEQIDN0789 

T AC T C AG G AC T CAT 

SEQIDNO7 90 
GACTGCGTAGTGATCNNN 

SEQIDN07 91 
GATGAGTCCTGAGTAANN 
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